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PREFACE TO THE THIRD RUSSIAN: 
- EDITION 7 


IN THIS edition the book has been considerably augmented and revised, 
with the assistance of L. P. Pitaevskii throughout. 

New sections have been added on the magnetic properties of gases, the 
thermodynamics of a degenerate plasma, liquid crystals, the fluctuation 
theory of phase transitions of the second kind, and critical phenomena. 
The chapters on solids and on the symmetry of crystals have been substan- 
tially enlarged, in particular by a fuller account of the theory of irreducible 
representations of space groups as applied to the physics of the crystal 
state. The sections on the fluctuation—dissipation theorem have been revised 
and extended. 

Some sections have been removed from the book, dealing with the theory 
of quantum liquids and the related theory of almost ideal degenerate gases. 
The physics of quantum liquids, which was founded and largely developed 
by the pioneering experiments of P. L. Kapitza and the theoretical work of 
Landau himself, has now become a wide subject whose significance goes far 
beyond its original concern, the liquid helium isotopes. An account of the 
theory of quantum liquids must now occupy its rightful place in even a 
general course of theoretical physics, and the few sections given to it in the 
earlier editions of this book are insufficient. 

They will appear, in a considerably expanded form, in another volume of 
this course, now being prepared by Pitaevskii and myself, which will also 
give a detailed treatment of the Green’s function method and the diagram 
technique, which have largely determined the development of statistical 
physics in the last 20 years. The transfer of these (and some other) topics 
to a separate volume is dictated not only by the fact that their inclusion in 
the present one would make it too large and would considerably alter its 
whole character. There is also the reason that such topics are essentially 
akin to hydrodynamics and macroscopic electrodynamics; for example, 
in presenting the microscopic theory of superconductivity it is convenient 
to make use of the known macroscopic theory of this phenomenon. For this 
reason, the new volume will stand as one of the course, after Mechanics 
and Electrodynamics of Continuous Media. 

The first version of this book (which included only classical statistical 
physics) appeared in 1938. The reader of today may be surprised to find 
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that the use of the general Gibbs method in statistical physics even in the 
1930s called for reasoning such as is given in the extracts (reproduced below) 
from the preface to that book. Perhaps it was just in the development of the 
exposition of general principles and numerous applications of statistical 
physics that Landau most showed his astonishing breadth of grasp of the 
whole subject, his astonishing ability to discern the most direct and effective 
way of deriving every result of the theory, whether major or minor. 

Lastly, on behalf of L. P. Pitaevskif and myself, may I sincerely thank 
I. E. Dzyaloshinskil, I. M. Lifshitz and V. L. Pokrovskil for many discussions 
of matters arising in the revision of this book. 


Moscow E. M. LirsHirz 
May 1975 


FROM THE PREFACES TO PREVIOUS 
RUSSIAN EDITIONS 


Ir 1s a fairly widespread delusion among physicists that statistical physics 
is the least well-founded branch of theoretical physics. Reference is generally 
made to the point that some of its conclusions are not subject to rigorous 
mathematical proof; and it is overlooked that every other branch of theo- 
retical physics contains just as many non-rigorous proofs, although 
these are not regarded as indicating an inadequate foundation for such 
branches. 

Yet the work of Gibbs transformed the statistical physics of Clausius, 
Maxwell and Boltzmann into a logically connected and orderly system. 
Gibbs provided a general method, which is applicable in principle to all 
problems that can be posed in statistical physics, but which unfortunately 
has not been adequately taken up. The fundamental inadequacy of the 
majority of existing books on statistical physics is precisely that their authors, 
instead of taking this general method as a basis, give it only incident- 
ally. 

Statistical physics and thermodynamics together form a unit. All the 
concepts and quantities of thermodynamics follow most naturally, simply 
and rigorously from the concepts of statistical physics. Although the 
general statements of thermodynamics can be formulated non-statistically, 
their application to specific cases always requires the use of statistical 
physics. 

We have tried in this book to give a systematic account of statistical 
physics and thermodynamics together, based on the Gibbs method. All 
specific problems are statistically analysed by general methods. In the proofs, 
our aim has been not mathematical rigour, which is not readily attainable 
in theoretical physics, but chiefly to emphasise the interrelation of different 
physical statements. 

In the discussion of the foundations of classical statistical physics, we 
consider from the start the statistical distribution for small parts (sub- 
systems) of systems, not for entire closed systems. This is in accordance with 
the fundamental problems and aims of physical statistics, and allows a com- 
plete avoidance of the problem of the ergodic and similar hypotheses, which 
in fact is not important as regards these aims. 

An ideal gas is regarded as a particular case from the standpoint of general 
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methods, and we have therefore not described the Boltzmann method as 
such. This method cannot be independently justified; in particular, the use 
of a priori probabilities is difficult to justify. The Boltzmann expression for 
the entropy of an ideal gas is derived from the general formulae of the Gibbs 
method. 


L. D. LANDAU 
1937-9 E. M. Lirsuirz 


NOTATION 


OPERATORS are denoted by a circumflex. 


Mean values of quantities are denoted by a bar over the symbol or by angle 
brackets (see the footnote after (1.5)). 


Phase space 


P, q generalised momenta and coordinates 

dp dq = dp, dp,... dp, dq, dq2... dg, volume element in phase space (with s 
degrees of freedom) 

dl’ = dp dq/(2xh)} 

‘i ... dl’ integral over all physically different states 


Thermodynamic quantities 


T temperature 
V volume 

P pressure 

E energy 

S entropy 


W= E+PYV heat function 

F = E-TS free energy 

® = E—TS+PY thermodynamic potential 
Q = —PY thermodynamic potential 

Cy, Cy specific heats 

Cp» Cy Molecular specific heats 

number of particles 

chemical potential 

surface-tension coefficient 

area of interface 


wR E S 


In all formulae the temperature is expressed in energy units; the method of con- 
verting to degrees is described in footnotes to §§ 9 and 42. 


References to other volumes in the Course of Theoretical Physics: 

Mechanics = Vol. 1 (Mechanics, third English edition, 1976). 

Fields = Vol. 2 (The Classical Theory of Fields, fourth English edition, 1975) 

Quantum Mechanics = Vol. 3 (Quantum Mechanics, third English edition, 1977). 

RQT = Vol. 4 (Relativistic Quantum Theory, Part 1, English edition, 1971). 

Elasticity = Vol. 7 (Theory of Elasticity, second English edition, 1970). 

Electrodynamics = Vol. 8 (Electrodynamics of Continuous Media, English 
edition, 1960). 


All are published by Pergamon Press. 
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CHAPTER I 


THE FUNDAMENTAL PRINCIPLES 
OF STATISTICAL PHYSICS 


§ 1. Statistical distributions 


Statistical physics, often called for brevity simply statistics, consists in the 
study of the special laws which govern the behaviour and properties of mac- 
roscopic bodies (that is, bodies formed of a very large number of individual 
particles, such as atoms and molecules). To a considerable extent the general 
character of these laws does not depend on the mechanics (classical or quan- 
tum) which describes the motion of the individual particles in a body, but 
their substantiation demands a different argument in the two cases. For con- 
venience of exposition we shall begin by assuming that classical mechanics is 
everywhere valid. 

In principle, we can obtain complete information concerning the motion of 
a mechanical system by constructing and integrating the equations of motion 
of the system, which are equal in number to its degrees of freedom. But if we 
are concerned with a system which, though it obeys the laws of classical me- 
chanics, has a very large number of degrees of freedom, the actual application 
of the methods of mechanics involves the necessity of setting up and solving 
the same number of differential equations, which in general is impracticable. 
It should be emphasised that, even if we could integrate these equations in a 
general form, it would be completely impossible to substitute in the general 
solution the initial conditions for the velocities and coordinates of all the par- 
ticles. c 

At first sight we might conclude from this that, as the number of particles 
increases, so also must the complexity and intricacy of the properties of the 
mechanical system, and that no trace of regularity can be found in the behav- 
iour of a macroscopic body. This is not so, however, and we shall see below 
that, when the number of particles is very large, new types of regularity ap- 
pear. , 
These statistical laws resulting from the very presence of a large number of 
particles forming the body cannot in any way be reduced to purely mechani- 
cal laws. One of their distinctive features is that they cease to have meaning 
when applied to mechanical systems with a small number of degrees of 
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freedom. Thus, although the motion of systems with a very large number of 
degrees of freedom obeys the same laws of mechanics as that of systems con- 
sisting of a small number of particles, the existence of many degrees of free- 
dom results in laws of a different kind. 

The importance of statistical physics in many other branches of theoretical 
physics is due to the fact that in Nature we continually encounter macroscop- 
ic bodies whose behaviour can not be fully described by the methods of 
mechanics alone, for the reasons mentioned above, and which obey statistical 
laws. 

In proceeding to formulate the fundamental problem of classical statistics, 
we must first of all define the concept of phase space, which will be constantly 
used hereafter. 

Let a given macroscopic mechanical system have s degrees of freedom: 
that is, let the position of points of the system in space be described by s co- 
ordinates, which we denote by q,, the suffix i taking the values 1, 2, ..., 5. 
Then the state of the system at a given instant will be defined by the values at 
that instant of the s coordinates q, and the s corresponding velocities q,. In 
statistics it is customary to describe a system by its coordinates and momenta 
P, not velocities, since this affords a number of very important advantages. 
The various states of the system can be represented mathematically by points 
in phase space (which is, of course, a purely mathematical concept); the co- 
ordinates in phase space are the coordinates and momenta of the system con- 
sidered. Every system has its own phase space, with a number of dimensions 
equal to twice the number of degrees of freedom. Any point in phase space, 
corresponding to particular values of the coordinates g, and momenta p, of 
the system, represents a particular state of the system. The state of the system 
changes with time, and consequently the point in phase space representing 
this state (which we shall call simply the phase point of the system) moves 
along a curve called the phase trajectory. 

Let us now consider a macroscopic body or system of bodies, and assume 
that the system is closed, i.e. does not interact with any other bodies. A part 
of the system, which is very small compared with the whole system but stil! 
macroscopic, may be imagined to be separated from the rest; clearly, when 
the number of particles in the whole system is sufficiently large, the number 
in a small part of it may still be very large. Such relatively small but still 
macroscopic parts will be called subsystems. A subsystem is again a mechani- 
cal system, but not a closed one; on the contrary, it interacts in various 
ways with the other parts of the system. Because of the very large number of 
degrees of freedom of the other parts, these interactions will be very complex 
and intricate. Thus the state of the subsystem considered will vary with 
time in a very complex and intricate manner. 

An exact solution for the behaviour of the subsystem can be obtained only 
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by solving the mechanical problem for the entire closed system, i.e. by setting 
up and solving all the differential equations of motion with given initial con- 
ditions, which, as already mentioned, is an impracticable task. Fortunately, 
it is just this very complicated manner of variation of the state of subsystems 
which, though rendering the methods of mechanics inapplicable, allows a 
different approach to the solution of the problem. 

A fundamental feature of this approach is the fact that, because of the 
extreme complexity of the external interactions with the other parts of the 
system, during a sufficiently long time the subsystem considered will be many 
times in every possible state. This may be more precisely formulated as 
follows. Let Jp dg denote some small “volume” of the phase space of the 
subsystem, corresponding to coordinates g, and momenta p, lying in short 
intervals 4g, and Ap, We can say that, in a sufficiently long time T, the 
extremely intricate phase trajectory passes many times through each such 
volume of phase space. Let At be the part of the total time T during which the 
subsystem was in the given volume of phase space 4p 4q.* When the total 
time T increases indefinitely, the ratio At/T tends to some limit 

w= lim At/T. (1.1) 
T — co 
This quantity may clearly be regarded as the probability that, if the subsys- 
tem is observed at an arbitrary instant, it will be found in the given volume of 
phase space Ap Aq. 
On taking the limit of an infinitesimal phase volume* 


dg dp = dqidqe... dg; dpi dpe... dp,, (1.2) 


we can define the probability dw of states represented by points in this vol- 
ume element, i.e. the probability that the coordinates g, and momenta p, have 
values in given infinitesimal intervals between gq, p, and g,+4dg,, p;+dp;- 
This probability dw may be written 


dw = 0(P1, .. +5 Ds, Vs --- Ys) Ap dg, (1.3) 


where 0(p,, --+> Py G1» +--+» 9,) iS a function of all the coordinates and 
momenta; we shall usually write for brevity o(p, 7) or even @ simply. The 
function 9, which represents the “density” of the probability distribution in 
phase space, is called the statistical distribution function, or _§ simply the 


t For brevity, we shall usually say, as is customary, that the system “is in the 
volume Ap Aq of.phase space”, meaning that the system is in states represented by 
phase points in that volume. 

+ In what follows we shall always use the conventional notation dp and dq to 
denote the products of the differentials of all the momenta and all the coordinates 
of the system respectively. 
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distribution function, for the body concerned. This function must obviously 
satisfy the normalisation condition 


J edpdg = 1 (1.4) 


(the integral being taken over all phase space), which simply expresses the 
fact that the sum of the probabilities of all possible states must be unity. 
The following circumstance is extremely important in statistical physics. 
The statistical distribution of a given subsystem does not depend on the 
initial state of any other small part of the same system, since over a sufficient- 
ly long time the effect of this initial state will be entirely outweighed by the 
‘effect of the much larger remaining parts of the system. It is also > independent 


of the initial state of the particular small part considered, since in time this 
part passes through all possible states, any of which can be taken as the 
initial state. Without having to solve the mechanical problem for a system 
(taking account of initial conditions), we can therefore find the statistical 
distribution for small parts of the system. 

The determination of the statistical distribution for any subsystem is in 
fact the fundamental problem of statistical physics. In speaking of “small 
parts” of a closed system, we must bear in mind that the macroscopic bodies 
with which we have to deal are usually themselves such “small parts” of a 
large closed system consisting of these bodies together with the external 
medium which surrounds them. 

If this problem is solved and the statistical distribution for a given subsys- 
tem is known, we can calculate the probabilities of various values of any 
physical quantities which depend on the states of the subsystem (i.e. on the 
values of its coordinates g and momenta p). We can also calculate the mean 
value of any such quantity f(p, q), which is obtained by multiplying each of 
its possible values by the corresponding probability and integrating over all 
states. Denoting the averaging by a bar, we can write 


f= Sf. ely, 9) dp dg, (1.5) 


from which the mean values of various quantities can be calculated by using 
the statistical distribution function.‘ 

The averaging with respect to the distribution function (called statistical 
averaging) frees us from the necessity of following the variation with time of 
the actual value of the physical quantity f(p, q) in order to determine its mean 
value. It is"also obvious t that, by the definition (1.1) of the probability, the 
statistical averaging is exactly equivalent to a time averaging. The latter 


T In this book we shall denote averaging by a bar over the symbol or by angie 
brackets: f or f », being influenced in this solely by convenience in writing the for- 
mulae, The second way is preferable for writing the mean values of lengthy ex- 
pressions. 


§ 1 Statistical Distributions 5 


would involve following the variation of the quantity with time, establishing 
the function f = f(4, and determining the required mean value as 


f= lim + fr dt. 


9 


The foregoing discussion shows that the deductions and predictions con- 
cerning the behaviour of macroscopic bodies which are made possible by 
statistical physics are probabilistic. In this respect statistical physics differs 
from (classical) mechanics, the deductions of which are entirely deterministic. 
It should be emphasised, however, that the probabilistic nature of the results 
of classical statistics is not an inherent property of the objects considered, 
but simply arises from the fact that these results are derived from much less 
information than would be necessary for a complete mechanical description 
(the initial values of the coordinates and momenta are not needed). 

In practice, however, when statistical physics is applied to macroscopic 
bodies, its probabilistic nature is not usually apparent. The reason is that, 
if any macroscopic body (in_external conditions independent of. * time) is 
observed over a sufficiently long period of time, it is found that all physical 
quantities describing the body are practically constant (and equal to their 
mean values) and undergo appreciable changes relatively very rarely; we 
mean, of course, macroscopic quantities describing the body as a whole or 
macroscopic parts of it, but not individual particles.t This result, which is 
fundamental to statistical physics, follows from very general considerations 
(to be discussed in § 2) and becomes more and more nearly valid as the body 
considered becomes more complex and larger. In terms of the statistical 
distribution, we can say that, if by means of the function o(p, g) we construct 
the probability distribution function for various values of the quantity 
f(p, 9), this function will have an extremely sharp maximum for f = f, and 
will be appreciably different from zero only in the immediate vicinity of this 
point. 

Thus, by enabling us to calculate the mean values of quantities describing 
macroscopic bodies, statistical physics enables us to make predictions which 
are valid to very high accuracy for by far the greater part of any time interval 
which is long enough for the effect of the initial state of the body to be entire- 
ly eliminated. In this sense the predictions of statistical physics become 


+t We may give an example to illustrate the very high degree of accuracy with 
which this is true. If we consider a region in a gas which contains, say, only 1/100 
gram-molecule, we find that the mean relative variation of the energy of this quan- 
tity of matter from its mean value is only ~ 107". The probability of finding 
(in a single observation) a relative deviation of the order of 107, say, is given by a 
fantastically small number, ~ 1073%10", 
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practically determinate and not probabilistic. (For this reason, we shall hence- 
forward almost always omit the bar when using mean values of macroscopic 
quantities.) 

If a closed macroscopic system is in a state such that in any macroscopic 
subsystem the macroscopic physical quantities are to a high degree of 
accuracy equal to their mean values, the system is said to be in a state of 
statistical equilibrium (or thermodynamic or thermal equilibrium). It is seen 
from the foregoing that, if a closed macroscopic system is observed for a 
sufficiently long period of time, it will be in a state of statistical equilibrium 
for much the greater part of this period. If, at any initial instant, a closed 
macroscopic system was not in a State of statistical equilibrium (if, for ex- 
ample, it was artificially disturbed from such a state by means of an external 
interaction and then left to itself, becoming again a closed system), it will 
necessarily enter an equilibrium state. The time within which it will reach 
statistical equilibrium is called the relaxation time. In using the term “suffi- 
ciently long” intervals of time, we have meant essentially times long com- 
pared with the relaxation time. ; 

The theory of processes relating to the attainment of an equilibrium state 
is called kinetics. It is not part of statistical physics proper, which deals 
only with systems in statistical] equilibrium. 


§ 2. Statistical independence 


The subsystems discussed in § 1 are not themselves closed systems; on the 
contrary, they are subject to the continuous interaction of the remaining 
parts of the system. But since these parts, which are small in comparison with 
the whole of the large system, are themselves macroscopic bodies also, we can 
still suppose that over not too long intervals of time they behave approxi- 
mately as closed systems. For the particles which mainly takepart in the 
interaction of a subsystem with the surrounding parts are those near the 
surface of the subsystem; the relative number of such particles, compared 
with the total number of particles in the subsystem, decreases rapidly when 
the size of the subsystem increases, and when the latter is sufficiently large 
the energy of its interaction with the surrounding parts will be small in com- 
parison with its internal energy. Thus we may say that the subsystems are 
quasi-closed. It should be emphasised once more that this property holds 
only over not too long intervals of time. Over a sufficiently long interval 
of time, the effect of interaction of subsystems, however weak, will ultimately 
appear. Moreover, it is just this relatively weak interaction which leads 
finally to the establishment of statistical equilibrium. 

The fact that different subsystems may be regarded as weakly interacting 
has the result that they may also be regarded as statistically independent. 
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By statistical independence we mean that the state of one subsystem does not_. 
affect the probabilities of various states of the other subsystems. |. 

 Let-us consider any two subsystems, and let dp dg™ and dp™ dq® be 
volume elements in their phase spaces. If we regard the two subsystems 
together as one composite subsystem, then the statistical independence of the 
subsystems signifies mathematically that the probability of the composite 
subsystem’s being in its phase volume. element dp@ dg@” = dp dq. 
dp™ dq: can be written as the product of the probabilities for the two 
subsystems to be respectively in dp dg® and dp®dg®, each of these 
probabilities depending only on the coordinates and momenta of the sub- 
system concerned. Thus we can write 


pie dp@) dg = 91 dp dq. 0. dp® dq®, 


or 
012 = 0102, (2.1) 


where 012 is the statistical distribution of the composite subsystem, and 
01, Q2 the distribution functions of the separate subsystems. A similar relation 
is valid for a group of several subsystems.* 

The converse statement is clearly also true: if the probability distribution 
for a compound system is a product of factors, each of which depends only 
on quantities describing one part of the system, then the parts concerned 
are statistically independent, and each factor is proportional to the proba- 
bility of the state of the corresponding part. 

If f; and f2 are two physical quantities relating to two different subsystems, 
then from (2.1) and the definition (1.5) of mean values it follows immediately 
that the mean value of the product fife is equal to the product of the mean 
values of the quantities f; and f2 separately: 


fife= fifo (2.2) 


Let us consider a quantity f relating to a macroscopic body or to a part 
of it. In the course of time this quantity varies, fluctuating about its mean 
value. We may define a quantity which represents the average range of this 
fluctuation. The mean value of the difference Af = f—f is not suitable for 
this purpose, since the quantity / varies from its mean value in both directions, 
and the difference f—/, which is alternately positive and negative, has mean 
value zero regardless of how often f undergoes considerable deviations from 
its mean value. The required characteristic may conveniently be defined as 
the mean square of this difference. Since (Af)? is always positive, its mean 
value tends to zero only if (4f)? itself tends to zero; that is, the mean value 


t Provided, of course, that these subsystems together still form only a small part 
of the whole closed system. 
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is small only when the probability of considerable deviations of f from 
f is small. The quantity ((4f)*) is called the root-mean-square (r.m.s.) 
fluctuation of the quantity £, Multiplying out the square (f—/)* shows that 


(4) =7?7- (2.3) 


i.e. the r.m.s. fluctuation is determined by the difference between the mean 
square of the quantity and the square of its mean. 

The ratio ((Afy’)”/f is called ther relative fluctuation of the quantity /. The 
smaller this ratio is, the more negligible is the proportion of time during 
which the body is in states where the deviation of f from its mean value is a 
considerable fraction of the mean value. 

We shall show that the relative fluctuations of physical quantities decrease 
rapidly when the size of the bodies (that is, the number of particles) to which 
they relate increases. To prove this, we first note that the majority of quan- 
tities of physical interest are additive. This property is a consequence of the 
fact that the various parts of a body are quasi-closed systems, and signifies 
that the value of such a quantity for the whole body is the sum of its values 
for the various (macroscopic) parts of the body. For example, since the 
internal energies of these parts are, as shown above, large compared with 
their interaction energies, it is sufficiently accurate to assume that the energy 
of the whole body is equal to the sum of the energies of its parts. 

Let f be such an additive quantity. We imagine the body concerned to 
be divided into a large number N of approximately equal small parts. Then 


Ze Mi 


where the quantities f; relate to the individual parts of the body. 
It is clear that, as the size of the body increases, f increases approximately 
in proportion to N. Let us also determine the r.m.s. fluctuation of f, We have 


(apy) = (Eas) 


Because of the statistical independence of the different parts of the body, the 
mean values of the products 4/, Af, are 


AfiAfe = 4fr-dfc=0 (i # k), 
since each Af, = 0. Hence 


N 
(4p) = p> (Af). (2.4) 


It follows that, as N increases, the mean square ((4f)*) also increases in pro- 
portion to N. The relative fluctuation is therefore inversely proportional 
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to VN: 

Ca os 

f V/N 

On the other hand, if we consider a homogeneous body to be divided into 
parts of a given small size, it is clear that the number of parts will be propor- 
tional to the total number of particles (molecules) in the body. Hence the 
result can also be stated by saying that the relative fluctuation of any additive 
quantity f decreases inversely as the square root of the number of particles 
in a macroscopic body, and so, when the number of these is sufficiently large, 
the quantity fitself may be regarded as practically constant in time and equal 
to its mean value. This conclusion has already been used in§ 1. 


§ 3. Liouville’s theorem 


Let us now return to a further study of the properties of the statistical 
distribution function, and suppose that a subsystem is observed over a very 
long interval of time, which we divide into a very large (in the limit, infinite) 
number of equal short intervals between instants f1, fe, .... At each of 
these instants the subsystem considered is represented in its phase space by a 
point A;, Ae, .... The set of points thus obtained is distributed in phase 
space with a density which in the limit is everywhere proportional to the 
distribution function @(p, g). This follows from the significance of the latter 
function as determining the probabilities of various states of the subsystem. 

Instead of considering points representing states of one subsystem at 
different instants f1, f2, ..., we may consider simultaneously, in a formal 
manner, a very large (in the limit, infinite) number of exactly identical 
subsystems,’ which at some instant, say ¢ = 0, are in states represented by 
the points A;, Ao, ... . 

We now follow the subsequent movement of the phase points which rep- 
resent the states of these subsystems over a not too long interval of time, 
such that a quasi-closed subsystem may with sufficient accuracy be regarded 
as closed. The movement of the phase points will then obey the equations 
of motion, which involve the coordinates and momenta only of the particles 
in the subsystem. 

It is clear that at any instant ¢ these points will be distributed in phase 
space according to the same distribution function e(p, ¢), in just the same 
way as at ¢ = 0. In other words, as the phase points move about in the 
course of time, they remain distributed with a density which is constant at 
any given point and is proportional to the corresponding value of 0. 


t Such an imaginary set of identical systems is usually called a statistical 
ensemble, 
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This movement of phase points may be formally regarded as a steady flow 
of a “gas” in phase space of 2s dimensions, and the familiar equation of 
continuity may be applied, which expresses the constancy of the total number 
of “particles” (in this case, phase points) in the gas. The ordinary equation 
of continuity is 

00/dt+div (ev) = 0, 


where ¢ is the density and v the velocity of the gas. For steady flow, we have 
div (ev) = 0. 


For a space of 2s dimensions, this will become 


2s 
vo Ox; (ev;) = =" 


In the present case the “coordinates” x, are the coordinates g and momenta 
p, and the “velocities” v, = x, are the time derivatives g and p given by the 
equations of motion. Thus we have 


p) Va (og) + =— = (0n)| = 0 


Expanding the derivatives gives 


oo Ogi + OBi | 
= 0). . 3.1 
= laesa, aq? a oe > lear Sal oo 
With the equations of motion in Hamilton’s form: 
Gi = OH/Op;, pi = —dH/0q:, 


where H = H(p, q) is the Hamiltonian for the subsystem considered, we see 
that | 
04q;/0g: = 0°A/Oq; ODP; = —Op;/Op;. 


The second term in (3.1) is therefore identically zero. The first term is just 
the total time derivative of the distribution function. Thus 
de 00 oe ..\ _ 
oey (<2 jie i) =0. G2) 
We therefore reach the important conclusion that the distribution function 
is constant along the phase trajectories of the subsystem. This is Liouville’s 
theorem. Since quasi-closed subsystems are under dicussion, the result is 


valid only for not too long intervals of time, during which the subsystem 
behaves as if closed, to a sufficient approximation. 
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§ 4. The significance of energy 


It follows at once from Liouville’s theorem that the distribution function 
must be expressible entirely in terms of combinations of the variables p and 
q which remain constant when the subsystem moves as a closed subsystem. 
These combinations are the mechanical invariants or integrals of the motion, 
which are the first integrals of the equations of motion. We may therefore 
say that the distribution function, being a function of the mechanical invar- 
iants, is itself an integral of the motion. 

It proves possible to restrict very considerably the number of integrals 
of the motion on which the distribution function can depend. To do this, we 
must take into account the fact that the distribution 12 for a combination 
of two subsystems is equal to the product of the distribution functions 1 
and @2 of the two subsystems separately: 012 = 0102. Hence 


log g12 = log e1+]og 02, (4.1) 


so that the logarithm of the distribution function is an additive quantity. 
We therefore reach the conclusion that the logarithm of the distribution 
function must be.not merely an integral of the motion, but an additive inte- 
gral of the motion. 

As we know from mechanics, there exist only seven independent additive 
integrals of the motion: the energy, the three components of the momentum 
vector and the three components of the angular momentum vector. We shall 
denote these quantities for the ath subsystem (as functions of the coordinates 
and momenta of the particles in it) by E,(p, g), P,(p, g). M,(p. g) respectively. 
The only additive combination of thesé quantities is a linear combination 
of the form 


log Ca = %a +BE,(p, q)+ YoP.(p, q) a 6-M,(p, q) (4.2) 


with constant coefficients «,, 8, Y, 8, of which 8, ¥, & must be the same 
for all subsystems in a given closed system. 

We shall return in Chapter ITI to a detailed study of the distribution (4.2); 
here we need note only the following points. The coefficient «, is just the 
normalisation constant, given by the condition f 0, dp dg = 1. The con- 
stants 6, Y, 6, involving seven independent quantities altogether, may be 
determined from the seven constant values of the additive integrals of the 
motion for the whole closed system. Thus we reach a conclusion very im- 
portant in statistical physics. The values of the additive integrals of the mo- 
tion (energy, momentum and angular momentum) completely define the 
statistical properties of a closed system, i.e. the statistical distribution of any 
of its subsystems, and therefore the mean values of any physical quantities 
relating to them. These seven additive integrals of the motion replace the 
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unimaginable multiplicity of data (initial conditions) which would be 
required in the approach from mechanics. 

The above arguments enable us at once to set up a simple distribution 
function suitable for describing the statistical properties of a closed system. 
Since, as we have seen, the values of non-additive integrals of the motion do 
not affect these properties, the latter can be described by any function @ 
which depends only on the values of the additive integrals of the motion for 
the system and which satisfies Liouville’s theorem. The simplest such function 
is @ = constant for all points in phase space which correspond to given 
constant values of the energy (Eo), momentum (Po) and angular momentum 
(Mo) of the system (regardless of the values of the non-additive integrals) 
and 9 = O at all other points, It is clear that a function so defined will cer- 
tainly remain constant along a phase trajectory of the system, i.e. will satisfy 
Liouville’s theorem. 

This formulation, however, is not quite exact. The reason is that the 
points defined by the equations 


E(p,q) = Eo, P(p,q)= Po, M(p,¢g) = Mo (4.3) 


form a manifold of only 2s—7 dimensions, not 2s like the phase volume. 
Consequently, if the integral { @ dp dq is to be different from zero, the func- 
tion e(p, q) must become infinite at these points. The correct way of writing 
the distribution function for a closed system is 


0 = constant X 6(E— Eo) 6(P — Po) 6(M— Mb). (4.4) 


The presence of the delta functions’ ensures that 9 is zero at all points in 
phase space where one or more of the quantities E, P, M is not equal to the 
given value Eo, Po or Mo. The integral of 9 over the whole of a phase volume 
which includes all or part of the above-mentioned manifold of points is 
finite. The distribution (4.4) is called microcanonical.t 

The momentum and angular momentum of a closed system depend on its 
motion as a whole (uniform translation and uniform rotation). We can 
therefore say that the statistical state of a system executing a given motion 
depends only on its energy. In consequence, energy is of exceptional impor- 
tance in statistical physics. 


t The definition and properties of the delta function are given, for example, in 
Quantum Mechanics,§ 5. 


t it should be emphasised once more that this distribution is not the true sta- 
tistical distribution for a closed system. Regarding it as the true distribution is equiv- 
alent to asserting that, in the course of a sufficiently long time, the phase trajec- 
tory of a closed system passes arbitrarily close to every point of the manifold defined 
by equations (4.3). But this assertion (called the ergodic hypothesis) is certainly not 
true in general. 
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In order to exclude the momentum and angular momentum from the sub- 
sequent discussion we may use the following device. We imagine the system 
to be enclosed in a rigid “box” and take coordinates such that the “box” is 
at rest. Under these conditions the momentum and angular momentum are 
not integrals of the motion, and the only remaining additive integral of the 
motion is the energy. The presence of the “box”, on the other hand, clearly 
does not affect the statistical properties of small parts of the system (subsys- 
tems). Thus for the logarithms of the distribution functions of the subsystems, 
instead of (4.2), we have the still simpler expressions 


log Qg = %¢ + PE,(p, 9). (4.5) 
The microcanonical distribution for the whole system is 


o = constant 6(E—Ep). (4.6) 


So far we have assumed that the closed system as a whole is in statistical 
equilibrium ; that is, we have considered it over times long compared with its 
relaxation time. In practice, however, it is usually necessary to discuss a 
system over times comparable with or even short relative to the relaxation 
time. For large systems this can be done, owing to the existence of what are 
called partial (or incomplete) equilibria as well as the complete statistical 
equilibrium of the entire closed system. Such equilibria occur because the 
relaxation time increases with the size of the system, and so the separate 
small parts of the system attain the equilibrium state considerably more 
quickly than equilibrium is established between these small parts. This 
means that each small part of the system is described by a distribution func- 
tion of the form (4.2), with the parameters 8, Y, 6 of the distribution having 
different values for different parts. In such a case the system is said to be in 
partial equilibrium. In the course of time, the partial equilibrium gradually 
becomes complete, and the parameters 8, ¥, 6 for each small part slowly 
vary and finally become equal throughout the closed system. 

Another kind of partial equilibrium is also of frequent occurrence, namely 
that resulting from a difference in the rates of the various processes occurring 
in the system, not from a considerable difference in relaxation time between 
the system and its small parts. One obvious example is the partial equilibrium 
in a mixture of several substances involved in a chemical reaction. Owing to 
the comparative slowness of chemical reactions, equilibrium as regards the 
motion of the molecules will be reached, in general, considerably more rapid- 
ly than equilibrium as regards reactions of molecules, i.e. as regards the com- 
position of the mixture. This enables us to regard the partial equilibria of the 
mixture as equilibria at a given (actually non-equilibrium) chemical com- 
position. 
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The existence of partial equilibria leads to the concept of macroscopic 
states of asystem. Whereas a mechanical microscopic description of the system 
specifies the coordinates and momenta of every particle in the system, a 
macroscopic description is one which specifies the mean values of the physi- 
cal quantities determining a particular partial equilibrium, for instance the 
mean values of quantities describing separate sufficiently small but macroscop- 
ic parts of the system, each of which may be regarded as being in a separate 
equilibrium. eee Be Ee 
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Turning now to the distinctive features of quantum statistics, we may note 
first of all that the purely mechanical approach to the problem of determin- 
ing the behaviour of a macroscopic body in quantum mechanics is of course 
just as hopeless as in classical mechanics. Such an approach would require 
the solution of Schrédinger’s equation for a system consisting of all the par- 
ticles in the body, a problem still more hopeless, one might even say, than the 
integration of the classical equations of motion. But even if it were possible 
in some particular case to find a general solution of Schrédinger’s equation, 
it would be utterly impossible to select and write down the particular solution 
satisfying the precise conditions of the problem and specified by particular 
values of an enormous number of different quantum numbers. Moreover, we 
shall see below that for a macroscopic body the concept of stationary states 
itself becomes to some extent arbitrary, a fact of fundamental significance. 

Let us first elucidate some purely quantum-mechanical features of macro- 
scopic bodies as compared with systems consisting of a relatively small 
number of particles. 

These features amount to an extremely high density of levels in the energy 
eigenvalue spectrum of a macroscopic body. The reason for this is easily seen 
if we note that, because of the very large number of particles in the body, a 
given quantity of energy can, roughly speaking, be “distributed” in innumer- 
able ways among the various particles. The relation between this fact and the 
high density of levels becomes particularly clear if we take as an example a 
macroscopic body consisting of a “gas” of N particles which do not interact 
at all, enclosed in some volume. The energy levels of such a system are just 
the sums of the energies of the individual particles, and the energy of each 
particle can range over an infinite series of discrete values.‘ It is clear that, on 
choosing in all possible ways the values of the N terms in this sum, we shall 


t The separations between successive energy levels of a single particle are inverse- 
ly proportional to the square of the linear dimensions Z of the volume enclosing 
it (~ #?/mL?, where m is the mass of the particle and f the quantum constant). 
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obtain a very large number of possible values of the energy of the system in 
any appreciable finite part of the spectrum, and these values will therefore lie 
very close together. 

It may be shown (see (7.18)) that the number of levels in a given finite 
range of the energy spectrum of a macroscopic body increases exponentially 
with the number of particles in the body, and the separations between levels 
are given by numbers of the form 10—" (where N is a number of the order of 
the number of particles in the body), whatever the units, since a change in 
the unit of energy has no effect on such a fantastically small number.* 

In consequence of the extremely high density of levels, a macroscopic body 
in practice can never be in a strictly stationary state. First of all, it is clear 
that the value of the energy of the system will always be “broadened” by 
an amount of the order of the energy of interaction between the system and 
the surrounding bodies. The latter is very large in comparison with the sepa- 
rations between levels, not only for quasi-closed subsystems but also for 
systems which from any other aspect could be regarded as strictly closed. In 
Nature, of course, there are no completely closed systems, whose interaction 
with any other body is exactly zero; and whatever interaction does exist, 
even if it is so small that it does not affect other properties of the system, will 
still be very large in comparison with the infinitesimal intervals in the energy 
spectrum. 

In addition to this, there is another fundamental reason why a macroscopic 
body in practice cannot be in a stationary state. It is known from quantum 
mechanics that the state of a system described by a wave function is the 
result of some process of interaction of the system with another system 
which obeys classical mechanics to a sufficient approximation. In this re- 
spect the occurrence of a stationary state implies particular properties of 
the system. Here we must distinguish between the energy E of the system 
before the interaction and the energy E’ of the state which results from the 
interaction. The uncertainties AE and AE’ in the quantities E and E’ are 
related to the duration At of the interaction process by the formula 


|AE’—AE| ~ h/At; 


see Quantum Mechanics, § 44. The two errors JE and AE’ are in general of 
the same order of magnitude, and analysis shows that we cannot make 


t It should be mentioned that this discussion is inapplicable to the initial part 
of the energy spectrum; the separations between the first few energy levels of a 
macroscopic body may even be independent of the size of the body. This point, 
however, does not affect the subsequent conclusions: when referred to a single 
particle, the separations between the first few levels for a macroscopic body are 
negligibly small, and the high density of levels mentioned in the text is reached 
for very small values of the energy relative to a single particle. 
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AE’ « AE. We can therefore say that AE’ ~ h/At. In order that the state 
may be regarded as stationary, the uncertainty JE’ must certainly be small 
in comparison with the separations between adjoining levels. Since the latter 
are extremely small, we see that, in order to bring the macroscopic body into 
a particular stationary state, an extremely long time At ~ h/AE’ would be 
necessary. In other words, we again conclude that strictly stationary states 
of a macroscopic body cannot exist. 

To describe the state of a macroscopic body by a wave function at all is 
impracticable, since the available data concerning the state of such a body are 
far short of the complete set of data necessary to establish its wave function. 
Here the position is somewhat similar to that which occurs in classical statis- 
tics, where the impossibility of taking account of the initial conditions for 
every particle in a body makes impossible an exact mechanical description of 
its behaviour; the analogy is imperfect, however, since the impossibility of a 
complete quantum-mechanical description and the lack of a wave function 
describing a macroscopic body may, as we have seen, possess a much more 
profound significance. 

The quantum-mechanical description based on an incomplete set of data 
concerning the system is effected by means of what is called a density matrix; 
see Quantum Mechanics, § 14. A knowledge of this matrix enables us to 
calculate the mean value of any quantity describing the system, and also the 
probabilities of various values of such quantities. The incompleteness of the 
description lies in the fact that the results of various kinds of measurement 
which can be predicted with a certain probability from a knowledge of the 
density matrix might be predictable with greater or even complete certainty 
from a complete set of data for the system, from which its wave function 
could be derived. 

We shall not pause to write out here the formulae of quantum mechanics 
relating to the density matrix in the coordinate representation, since this 
representation is seldom used in statistical physics, but we shall show how the 
density matrix may be obtained directly in the energy representation, which is 
necessary for statistical applications. 

Let us consider some subsystem, and define its “stationary states” as the 
states obtained when all interactions of the subsystem with the surrounding 
parts of a closed system ate entirely neglected. Let y,(g) be the normalised 
wave functions of these states (without the time factor), g conventionally 
denoting the set of all coordinates of the subsystem, and the suffix n the set 
of all quantum numbers which distinguish the various stationary states; the 
energies of these states will be denoted by E,. 

Let us assume that at some instant the subsystem is in a completely de- 
scribed state with wave function y. The latter may be expanded in terms 
of the functions »,(q), which form a complete set; we write the ex- 
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pansion as 
p= 2, eniPn (5.1) 


The mean value of any quantity / in this state can be calculated from the 
coefficients c, by means of the formula 


ie py 2 CAC mT risn’s (5.2) 


where 
fam = | pat m aq (5.3) 


are the matrix elements of the quantity f(/ being the corresponding operator). 
The change from the complete to the incomplete quantum-mechanical 
description of the subsystem may be regarded as a kind of averaging over its 
various y states. In this averaging, the products c{c,, give a double set (two 
suffixes) of quantities, which we denote by w,,,, and which cannot be ex- 
pressed as products of any quantities forming a single set. The mean value 

of fis now given by 
f= » » Wasi Janis (5.4) 


The set of quantities w,,,, (which in general are functions of time) is the 
density matrix in the energy representation; in statistical physics it is called 
the statistical matrix.' 

If we regard the w,,,, aS the matrix elements of some statistical operator 
w, then the sum )° Wwf, Will be a diagonal matrix element of the operator 


n 
product #/, and the mean value f becomes the trace (sum of diagonal 
elements) of this operator: 


f= Lf Yan = tr (Vf). (5.5) 


This formula has the advantage of enabling us to calculate with any com- 
plete set of orthonormal wave functions: the trace of an operator is independ- 
ent of the particular set of functions with respect to which the matrix ele- 
ments are defined; see Quantum Mechanics, § 12. 


t The energy representation is mentioned here, as being the one generally used 
in statistical physics. We have not so far, however, made direct use of the fact that 
the tp, are wave functions of stationary states. It is therefore clear that the same 
method could be used to define the density matrix with respect to any complete 
set of wave functions. 

The usual coordinate density matrix 0(q, q’) (see Quantum Mechanics, § 14) is 
expressed in terms of the matrix w,,, by 


09, 7’) = > Wann) YPm{q)- 


3a 
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The other expressions of quantum mechanics which involve the quantities 
c, are similarly modified, the products c*c,, being everywhere replaced by 
the “averaged values” w,,, 


CrCm > Wmn- 


For example, the probability that the subsystem is in the nth state is equal 
to the corresponding diagonal element w,, of the density matrix (instead of 
the squared modulus c°c,). It is evident that these elements, which we shall 
denote by w,, are always positive: 


Wn = Wan > 0, (5.6) 
and satisfy the normalisation condition 
trrw= yw, = 1 (5.7) 


(corresponding to the condition }'|c,|? = ). 
n 


It must be emphasised that-the averaging over various y states, which we 
have used in order to illustrate the transition from a complete to an incom- 
plete quantum-mechanical description, has only a very formal significance. 
In particular, it would be quite incorrect to suppose that the description by 
means of the density matrix signifies that the subsystem can be in various » 
states with various probabilities and that the averaging is over these proba- 
bilities. Such a treatment would be in conflict with the basic principles of 
quantum mechanics. 

The states of a quantum-mechanical system that are described by wave 
functions are sometimes called pure states, as distrinct from mixed states, 
which are described by a density matrix. Care should, however, be taken not 
to misunderstand the latter term in the way indicated above. 

The averaging by means of the statistical matrix according to (5.4) has a 
twofold nature. It comprises both the averaging due to the probabilistic 
nature of the quantum description (even when as complete as possible) and 
the statistical averaging necessitated by the incompleteness of our informa- 
tion concerning the object considered. For a pure state only the first averag- 
ing remains, but in statistical cases both types of averaging are always pres- 
ent. It must be borne in mind, however, that these constituents cannot be 
separated; the whole averaging procedure is carried out as a single operation, 
and cannot be represented as the result of successive averagings, one purely 
quantum-mechanical and the other purely statistical. 

The statistical matrix in quantum statistics takes the place of the distribu- 
tion function in classical statistics. The whole of the discussion in the previ- 
ous sections concerning classical statistics and the, in practice, deterministic 
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nature of its predictions applies entirely to quantum statistics also. The proof 
given in § 2 that the relative fluctuations of additive physical quantities tend 
to zero as the number of particles increases made no use of any specific prop- 
erties of classical mechanics, and so remains entirely valid in the quantum 
case. We can therefore again assert that macroscopic quantities remain prac- 
tically equal to their mean values. 

In classical statistics the distribution function e(p, q) gives directly the 
probability distribution of the various values of the coordinates and mo- 
menta of the particles of the body. In quantum statistics this is no longer 
true; the quantities w, give only the probabilities of finding the body in a 
particular quantum state, with no direct indication of the values of the co- 
ordinates and momenta of the particles. oS 

From the very nature of quantum mechanics, the statistics based on it can 
deal only with the determination of the probability distribution for the co- 
ordinates and momenta separately, not together, since the coordinates and 
momenta of a particle cannot simultaneously have definite values. The re- 
quired probability distributions must reflect both the statistical uncertainty 
and the uncertainty inherent in the quantum-mechanical description. To find 
these distributions, we repeat the arguments given above. We first assume 
that the body is in a pure quantum state with the wave function (5.1). The 
probability distribution for the coordinates is given by the squared modulus 


pl? = dD CaCm PnP » 


so that the probability that the coordinates have values in a given interval 
dq = dq, dg, ... dq, is dw, = | y|? dg. For a mixed state, the products c*c,, 
are replaced by the elements w,,, of the statistical matrix, and |y|? thus 
becomes 

2 2 WiniPam . 


By the definition of the matrix elements, 
»» WinnYmn = WYn ) 
and so 


»» » Winn m = »» paw Pra 


Thus we have the following formula for the coordinate probability distri- 


bution: 
dwq = Dyan dq. . (5.8) 


In this expression the functions y, may be any complete set of normalised 
wave functions. 
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Let us next determine the momentum probability distribution. The quan- 
tum states in which all the momenta have definite values correspond to free 
motion of all the particles. We denote the wave functions of these states by 
~,(q), the suffix p conventionally representing the set of values of all the mo- 
menta. As we know, the diagonal elements of the density matrix are the prob- 
abilities that the system is in the corresponding quantum states. Hence, 
having determined the density matrix with respect to the set of functions y,, 
we obtain the required momentum probability distribution from the for- 
mula‘ 

dWp = Wop dp = dp: | yprinry dq, (5.9) 
where dp = dp, dp, ... dp,. 

It is interesting that both distributions (coordinate and momentum) can 
be obtained by integrating the same function 


I(q, P) = yp(9) Wyp(Q). (5.10) 


Integration of this expression with respect to q gives the momentum distri- 
bution (5.9). Integration with respect to p gives 


dw, = dg f y3(q) oyp(9) dp, (5.11) 


in agreement with the general definition (5.8). Note also that the function 
(5.10) can be expressed in terms of the coordinate density matrix o(g, q/) by 


1(q, p) = yp(9) | 0(4, 9’) vol’) dq’ . (5.12) 


It must be emphasised, however, that this does not at all imply that the 
function J(q, p) may be regarded as a probability distribution for coordi- 
nates and momenta simultaneously; the expression (5.10) is in any case com- 
plex, quite apart from the fact that such a treatment would altogether 
contradict the fundamental principles of quantum mechanics.? 


t The functions w,(q) are plane waves in the configuration space of the system; 
they are assumed normalised by the delta function of all the momenta. 


t Since Iq, p) has no direct physical significance, the definition of the function 
with the properties stated is of course not unique. For example, the g and p 
distributions can be obtained by the same method from the function 


Tyha,p)= f o(at+36,q—46) vi(at+4é) v(a—8) dé, (5.10) 


where & denotes the set of auxiliary variables ¢,,...,&, and d& = dé, ... dé, 
(E. P. Wigner, 1932): since 

J v3(a+2£) volo — 28) dp = 6(g+38—-4+ 28) = 66, 
the integral J Iw dp = o(@, q). The integral j Ty dq, after the change of variables 
qtié > 4, q—st ~ q’, is the same as f I dq. Unlike 7(q, p), Iyq, p) is real (as can 


be easily seen by using the fact that the matrix e(g, q’) is Hermitian), but in general 
it is not everywhere positive. 
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§ 6. Statistical distributions in quantum statistics 


In quantum mechanics a theorem can be proved which is analogous to 
Liouville’s theorem derived in § 3 on the basis of classical mechanics. 

To do this, we first derive a general equation of quantum mechanics which 
gives the time derivative of the statistical matrix of any (closed) system.* 
Following the method used in § 5, we first assume that the system is in a pure 
state with a wave function represented in the form of a series (5.1). Since the 
system is closed, its wave function will have the same form at all subsequent 
instants, but the coefficients c, will depend on the time, being proportional to 
factors e~!#="*, We therefore have 


er ae = * 
i (CiCm) = yen Em Crem > 


The change to the statistical matrix in the general case of mixed states is now 
effected by replacing the products cic, DY Wm» and this gives the required 


equation: 
Winn = (i/f) (E,—Em)Winn- (6.1) 


This equation can be written in a general operator form by noticing that 


(E, —E Winn = »y WriHin = AmtWin), 


where H,,,, are the matrix elements of the Hamiltonian 7 of the system; this 
matrix is diagonal in the energy representation, which we are using. Hence 


w= (i/f) (WAH). (6.2) 


It should be pointed out that this expression differs in sign from the usual 
quantum-mechanical expression for the operator of the time derivative of a 
quantity. 

We see that, if the time derivative of the statistical matrix is zero, the oper- 
ator w must commute with the Hamiltonian of the system. This result is the 
quantum analogue of Liouville’s theorem: in classical mechanics the require- 
ment of a stationary distribution function has the result that w is an integral 
of the motion, while the commutability of the operator of a quantity with the 
Hamiltonian is just the condition, in quantum mechanics, that that quantity 
is conserved. 

In the energy representation used here, the condition is particularly simple: 
(6.1) shows that the matrix w,,,, must be diagonal, again in accordance with 


t In § 5 the density matrix of a subsystem was discussed, having regard to its 
fundamental applications in statistical physics, but a density matrix can of course 
also be used to describe a closed system in a mixed state. 
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the usual matrix condition that a quantity is conserved in quantum mechan- 
ics, namely that the matrix of such a quantity can be brought to diagonal 
form simultaneously with the Hamiltonian. 

As in§ 3, wecan now apply the results obtained to quasi-closed sainepadinn: 
for intervals of time during which they behave to a sufficient approximation 
as closed systems. Since the statistical distributions (or in this case the statis- 
tical matrices) of subsystems must be stationary, by the definition of statis- 
tical equilibrium, we first of all conclude that the matrices w,,,, are diagonal 
for all subsystems.! The problem of determining the statistical distribution 
therefore amounts to a calculation of the probabilities w, = w,,,. which 
represent the “distribution function” in quantum statistics. Formula (5.4) 
for the mean value of any quantity f becomes simply 


= 2 Wifuns (6.3) 


and contains only the diagonal matrix elements /,,. 

Next, using the facts that w must be a quantum-mechanical integral of 
the motion and that the subsystems are quasi-independent, we find in a 
similar way to the derivation of (4.5) that the logarithm of the distri- 
bution function for subsystems must be of the form 


log w = o@ 4 REO, (6.4) 


where the index a corresponds to the various subsystems. Thus the probabil- 
ities w, can be expressed as a function of the energy level alone: w, = w(E,). 

Finally, the discussion in § 4 concerning the significance of additive inte- 
grals of the motion, and in particular the energy, as determining all the statis- 
tical properties of a closed system, remains entirely valid. This again enables 
us to set up for a closed system a simple distribution function suitable for 
describing its statistical properties though (as in the classical case) certainly 
not the true distribution function. 

To formulate mathematically this “quantum microcanonical distribution” 
we must use the following device. The energy spectra of macroscopic bodies 
being “almost continuous”, we make use of the concept of the number of 
quantum states of a closed system which “belong” to a particular infinitesi- 
mal range of values of its energy.t We denote this number by dJ’; it plays a 


t Since this statement involves neglecting the interactions between subsystems, 
it is more precise to say that the non-diagonal elements w,,,, tend to zero as the 
relative importance of these interactions decreases, and therefore as the number 
of particles in the subsystems increases. 

t It will be remembered that in § 4 we agreed to ignore entirely the momentum 
and angular momentum of the system as a whole, for which purpose it is sufficient 
to consider a system enclosed in a rigid “box” with coordinates such that the box 
is at rest. 


§7 Entropy 23 


part analogous to that of the phase volume element dp dq in the classical 
case. 

If we regard a closed system as consisting of subsystems, and neglect the 
interaction of the latter, every state of the whole system can be described by 
specifying the states of the individual subsystems, and the number dJ is a 


product 
a’ = [J] dl, (6.5) 


of the numbers dI°, of the quantum states of the subsystems (such that the 
sum of the energies of the subsystems lies in the specified interval of energy 
of the whole system). 

We can now formulate the microcanonical distribution analogously to the 
classical expression (4.6), writing 


dw = constant X 6(E—Eo) [[ dl, (6.6) 


for the probability dw of finding the system in any of the dJ' states. 
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Let us consider a closed system for a period of time long compared with 
its relaxation time; this implies that the system is in complete statistical 
equilibrium. 

The following discussion will be given first of all for quantum statistics. 
Let us divide the system into a large number of macroscopic parts (sub- 
systems) and consider any one of them. Let w,, be the distribution function 
for this subsystem ; to simplify the formulae we shall at present omit from w, 
(and other quantities) the suffix indicating the subsystem. By means of the 
function w, we can, in particular, calculate the probability distribution of 
the various values of the energy E of the subsystem. We have seen that w,, 
may be written as a function of the energy alone, w, = w(£,). In order to 
obtain the probability W(Z)d£ that the subsystem has an energy between 
E and E+dE, we must multiply w(Z) by the number of quantum states with 
energies in this interval; here we use the same idea of a “broadened” energy 
Spectrum as was mentioned at the end of § 6. Let /'(E) denote the number of 
quantum states with energies less than or equal to E. Then the required 
number of states with energy between E and E+dE can be written 

dI(E) 
de 


and the energy probability distribution is 
v 
we) = {2 wey, (7.1) 
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The normalisation condition 
JW(EdE=1 


signifies geometrically that the area under the curve W = W(E) is unity. 

In accordance with the general statements in § 1, the function W(£) has a 
very sharp maximum at E = E, being appreciably different from zero only 
in the immediate neighbourhood of this point. We may define the “width” 
AE of the curve W = W(E)as the width of a rectangle whose height is equal 
to the value of the function W(E) at the maximum and whose area is unity: 


W(E) AE = 1. (7.2) 
Using the expression (7.1), we can write this definition as 
WE)AL = 1, (7.3) 
where 
A= vo AE (7.4) 


is the number of quantum states corresponding to the interval JE of energy. 
The quantity AI thus defined may be said to represent the “degree of broad- 
ening” of the macroscopic state of the subsystem with respect to its micro- 
scopic states. The interval AE is equal in order of magnitude to the mean 
fluctuation of energy of the subsystem. 

These definitions can be immediately applied to classical statistics, except 
that the function w(£) must be replaced by the classical distribution function 
oe, and AI by the volume of the part of phase space defined by the formula 


(FE) Ap Ag = 1. (7.5) 


The phase volume Ap Aq, like AI’, represents the size of the region of phase 
space in which the subsystem will almost always be found. 

It is not difficult to establish the relation between AJ" in quantum theory 
and Ap Aq in the limit of classical theory. In the quasi-classical case, a 
correspondence can be set up between the volume of a region of phase 
space and the “corresponding” number of quantum states (see Quantum 
Mechanics, § 48): we can say that a “cell” of volume (27h) (where s is the 
number of degrees of freedom of the system) “corresponds” in phase space 
to each quantum state. It is therefore clear that in the quasi-classical case 
the number of states AJ" may be written 


AI = Ap Aq|(2nh):, (7.6) 


where s is the number of degrees of freedom of the subsystem considered. 
This formula gives the required relation between AI‘ and Ap Aq. 
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The quantity AI is called the statistical weight of the macroscopic state of 
the subsystem, and its logarithm 


S = log Ar (7.7) 


is called the entropy of the subsystem. In the case of classical statistics the 
corresponding expression is 

S = log on (7.8) 
The entropy thus defined is dimensionless, like the statistical weight itself. 
Since the number of states AI is not less than unity, the entropy cannot be 
negative. The concept of entropy is one of the most important in statistical 
physics. 

It is apposite to mention that, if we adhere strictly to the standpoint of 
classical statistics, the concept of the “number of microscopic states” cannot 
be defined at all, and we should have to define the statistical weight simply as 
Ap Aq. But this quantity, like any volume in phase space, has the dimensions 
of the product of s momenta and s coordinates, i.e. the sth power of action 
((erg-sec)’). The entropy, defined as log Ap Aq, would then have the peculiar 
dimensions of the logarithm of action. This means that the entropy would 
change by an additive constant when the unit of action changed: if the unit 
were changed by a factor a, 4p 4q would become a‘ Ap Aq, and log Ap Aq 
would become log Ap 4q +s log a. In purely classical statistics, therefore, the 
entropy is defined only to within an additive constant which depends on the 
choice of units, and only differences of entropy, i.e. changes of entropy in a 
given process, are definite quantities independent of the choice of units. 

This accounts for the appearance of the quantum constant # in the defi- 
nition (7.8) of the entropy for classical statistics. Only the concept of the 
number of discrete quantum states, which necessarily involves a non-zero 
quantum constant, enables us to define a dimensionless statistical weight and 
So to give an unambiguous definition of the entropy. 

We may write the definition of the entropy in another form, expressing 
it directly in terms of the distribution function. According to (6.4), the loga- 
rithm of the distribution function of a subsystem has the form 


log w(E,,) = «+ BEp. 
Since this expression is linear in E,, the quantity 
log w(£) = «+BFE 


can be written as the mean value (log w(E,)). The entropy S = log AI’ = 
= — log w(F) (from (7.3)) can therefore be written 


S= —(log W(En))» (7.9) 
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i.e. the entropy can be defined as minus the mean logarithm of the distribu- 
tion function of the subsystem. From the significance of the mean value, 


S =—)' w, log wa; (7.10) 


this expression can be written in a general operator form independent of the 
choice of the set of wave functions with respect to which the statistical 
matrix elements are defined :' 


= —tr (# log w). (7.11) 
Similarly, in classical statistics, the definition of the entropy can be written 


S = —(log [(2xf) @]) 
=~ f @ log [(2nh) 0] dp dg. (7.12) 


Let us now return to the closed system as a whole, and let 4/1, 4s, ... 
be the statistical weights of its various subsystems. If each of the subsystems 
can be in one of AI, quantum states, this gives 


AD =| AT, (7.13) 


as the number of different states of the whole system. This is called the statis- 
tical weight of the closed system, and its logarithm is the entropy S of the 


system. Clearly 
S=>)S,, (7.14) 


i.e. the entropy thus defined is additive: the entropy of a composite system is 
equal to the sum of the entropies of its parts. 

For a clear understanding of the way in which entropy is defined, it is 
important to bear in mind the following point. The entropy of a closed sys- 
tem (whose total energy we denote by Eo) in complete statistical equilibrium 
can also be defined directly, without dividing the system into subsystems. 
To do this, we imagine that the system considered is actually only a small 
part of a fictitious very large system (called in this connection a thermostat or 
heat bath). The thermostat is assumed to be in complete equilibrium, in such a 
way that the mean energy of the system considered (which is now a non- 
closed subsystem of the thermostat) is equal to its actual energy Eo. Then we 
can formally assign to the system a distribution function of the same form as 
for any subsystem of it, and by means of this distribution determine its statis- 


t In accordance with the general rules, the operator log w must be understood 
as an operator whose eigenvalues are equal to the logarithms of the eigenvalues of 
the operator #, and whose eigenfunctions are the same as those of ¥. 
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tical weight 4’, and therefore the entropy, directly from the same formulae 
(7.3)-(7.12) as were used for subsystems. It is clear that the presence of the 
thermostat has no effect on the statistical properties of individual small parts 
(subsystems) of the system considered, which in any case are not closed and 
are in equilibrium with the remaining parts of the system. The presence of 
the thermostat therefore does not alter the statistical weights AI’, of these 
parts, and the statistical weight defined in the way just described will be the 
same as that previously defined as the product (7.13). 

So far we have assumed that the closed system is in complete statistical 
equilibrium. We must now generalise the above definitions to systems in 
arbitrary macroscopic states (partial equilibria). 

Let us suppose that the system is in some state of partial equilibrium, and 
consider it over time intervals 4t which are small compared with the relaxa- 
tion time for complete equilibrium. Then the entropy must be defined as 
follows. We imagine the system divided into parts so small that their re- 
spective relaxation times are small compared with the intervals 4t(remember- 
ing that the relaxation times in general decrease with decreasing size of the 
system). During the time At such parts may be regarded as being in their own 
particular equilibrium states, described by certain distribution functions. 
We can therefore apply to them the previous definition of the statistical 
weights AI”, and so calculate their entropies S,. The statistical weight AI’ of 
the whole system is then defined as the product (7.13), and the corresponding 
entropy S as the sum of the entropies S,. 

It should be emphasised, however, that the entropy of a non-equilibrium 
system, defined in this way as the sum of the entropies of its parts (satis- 
fying the above condition), cannot now be calculated by means of the ther- 
mostat concept without dividing the system into parts. At the same time this 
definition is unambiguous in the sense that further division of the subsystems 
into even smaller parts does not alter the value of the entropy, since each 
subsystem is already in “complete” internal equilibrium. 

In particular, attention should be drawn to the significance of time in the 
definition of entropy. The entropy is a quantity which describes the average 
properties of a body over some non-zero interval of time At. If At is given, to 
determine S we must imagine the body divided into parts so small that their 
relaxation times are small in comparison with At. Since these parts must also 
themselves be macroscopic, it is clear that when the intervals At are too short 
the concept of entropy becomes meaningless; in particular, we cannot speak 
of its instantaneous value. 

Having thus given a complete definition of the entropy, let us now ascer- 
tain the most important properties and the fundamental physical significance 
of this quantity. To do so, we must make use of the microcanonical distribu- 
tion, according to which a distribution function of the form (6.6) may be used 
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to describe the statistical properties of a closed system: 


dw = constant X 6(E—Ep): [] dfa. 


Here dl”, may be taken as the differential of the function I\,(Z,), which rep- 
resents the number of quantum states of a subsystem with energies less than 
or equal to E,. We can write dw as 


dw = constant x 6(E—E)- [] (d/./dE,) dE. (7.15) 


The statistical weight 4I”, by definition, is a function of the mean energy 
E, of the subsystem; the same applies to S, = S,(£,). Let us now formally re- 
gard AJ", and S, as functions of the actual energy EZ, (the same functions as 
they really are of E,). Then we can replace the derivatives dJ",(£,)/dE, in 
(7.15) by the ratios 4I’,/AE,, where AI’, is a function of £, in this sense, and 
AE, the interval of energy corresponding to AI, (also a function of E£,). Fi- 
nally, replacing AI‘, by e*), we obtain 


dw = constant X 6(E—Ep)eS [| dE./4E, (7.16) 


where 
S = ¥ Sa(E,) 


is the entropy of the whole closed system, regarded as a function of the exact 
values of the energies of its parts. The factor e*, whose exponent is an addi- 
tive quantity, is a very rapidly varying function of the energies E,. In compari- 
son with this function, the energy dependence of the quantity J7 AE, is quite 
unimportant, and we can therefore replace (7.16) with very high accuracy by 


dw = constant x 6(E—Ep)e5 [] dEa. (7.17) 


But dw expressed in a form proportional to the product of all the differen- 
tials dE, is just the probability that all the subsystems have energies in given 
intervals between E, and E,+d£,. Thus we see that this probability is deter- 
mined by the entropy of the system as a function of the energies of the sub- 
systems; the factor 6(£—Z£,) ensures that the sum E = ZE, has the given 
value Eo of the energy of the system. This property of the entropy, as we 
shall see later, is the basis of its applications in statistical physics. 

We know that the most probable values of the energies Z, are their mean 
values £,. This means that the function S(E,, E,, ...) must have its maxi- 
mum possible value (for a given value of the sum XE, = E,) when E, = £,. 
But the E, are just the values of the energies of the subsystems which corres- 
pond to complete statistical equilibrium of the system. Thus we reach the 
important conclusion that the entropy of a closed system in a state of 
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complete statistical equilibrium has its greatest possible value (for a given 
energy of the system). 

Finally, we may mention another interesting interpretation of the function 
S = S(E), the entropy of any subsystem or closed system; in the latter case it 
is assumed that the system is in complete equilibrium, so that its entropy 
may be expressed as a function of its total energy alone. The statistical 
weight AI’ = e*), by definition, is the number of energy levels in the inter- 
val AE which describes in a certain way the width of the energy probability 
distribution. Dividing AE by AI’, we obtain the mean separation between 
adjoining levels in this interval (near the energy E) of the energy spectrum 
of the system considered. Denoting this distance by D(E), we can write 


D(E) = AE-e~S®), (7.18) 


Thus the function S(£) determines the density of levels in the energy spec- 
trum of a macroscopic system. Since the entropy is additive, we can say that 
the mean separations between the levels of a macroscopic body decrease 
exponentially with increasing size of the body (i.e. with increasing number of 
particles in it). 


§ 8. The law of increase of entropy 


If a closed system is not in a state of statistical equilibrium, its macroscop- 
ic state will vary in time, until ultimately the system reaches a state of 
complete equilibrium. If each macroscopic state of the system is described 
by the distribution of energy between the various subsystems, we can say 
that the sequence of states successively traversed by the system corresponds 
to more and more probable distributions of energy. This increase in proba- 
bility is in general very considerable, because it is exponential, as shown in 
§ 7. We have seen that the probability is given by e*, the exponent being an 
additive quantity, the entropy of the system. We can therefore say that the 
processes occurring in a non-equilibrium closed system do so in such a way 
that the system continually passes from states of lower to those of higher 
entropy until finally the entropy reaches the maximum possible value, cor- 
responding to complete statistical equilibrium. 

Thus, if a closed system is at some instant in a non-equilibrium macroscopic 
state, the most probable consequence at later instants is a steady increase 
in the entropy of the system. This is the law of increase of entropy or 
second law of thermodynamics, discovered by R. Clausius (1865); its statisti- 
cal explanation was given by L. Boltzmann in the 1870s. 

In speaking of the “most probable” consequence, we must remember that 
in reality the probability of transition to states of higher entropy is so enor- 
mous in comparison with that of any appreciable decrease in entropy that 
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in practice the latter can never be observed in Nature. Ignoring decreases 
in entropy due to negligible fluctuations, we can therefore formulate the law 
of increase of entropy as follows: if at some instant the entropy of a closed 
system does not have its maximum value, then at subsequent instants the 
entropy will not decrease; it will increase or at least remain constant. 

There is no doubt that the foregoing simple formulations accord with 
reality; they are confirmed by all our everyday observations. But when we 
consider more closely the problem of the physical nature and origin of these 
laws of behaviour, substantial difficulties arise, which to some extent have 
not yet been overcome. 

Firstly, if we attempt to apply statistical physics to the entire Universe, 
regarded as a single closed system, we immediately encounter a glaring 
contradiction between theory and experiment. According to the results of 
statistics, the universe ought to bein a state of complete statistical equilib- 
rium. More precisely, any finite region of it, however large, should have a 
finite relaxation time and should be in equilibrium. Everyday experience 
shows us, however, that the properties of Nature bear no resemblance to 
those of an equilibrium system; and astronomical results show that the 
same is true throughout the vast region of the Universe accessible to our 
observation. 

The escape from this contradiction is to be sought in the genera! theory 
of relativity. The reason is that, when large regions of the Universe are con- 
sidered, the gravitational fields present become important. These fields 
are just a change in the space-time metric. When the statistical properties 
of bodies are discussed, the metric properties of space-time may in a sense 
be regarded as “external conditions” to which the bodies are subject. The 
statement that a closed system must, over a sufficiently long time, reach 
a state of equilibrium, applies of course only to a system in steady external 
conditions. On the other hand, the general cosmological expansion of the 
Universe means that its metric depends essentially on time, so that the 
“external conditions” are by no means steady in this case. Here it is impor- 
tant that the gravitational field cannot itself be included in a closed system, 
Since the conservation laws which are, as we have seen, the foundation of 
statistical physics would then reduce to identities. For this reason, in the 
general theory of relativity, the Universe as a whole must be regarded not 
as a closed system but as a system in a variable gravitational field. Conse- 
quently the application of the law of increase of entropy does not prove 
that statistical equilibrium must necessarily exist. 

Thus this aspect of the problem of the Universe as a whole indicates the 
physical basis of the apparent contradictions. There are, however, other 
difficulties in understanding the physical nature of the law of increase of en- 


tropy. 
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Classical mechanics itself is entirely symmetrical with respect to the two 
directions of time. The equations of mechanics remain unaltered when the 


time ¢ is replaced by —1; if these equations allow any particular motion, 


they will therefore allow the reverse motion, in which the mechanical system 
passes through the same configurations in the reverse order. This symmetry 
must naturally be preserved in a statistics based on classical mechanics. 
Hence, if any particular process is possible which is accompanied by an 
increase in the entropy of a closed macroscopic system, the reverse process 
must also be possible, in which the entropy of the system decreases. The 
formulation of the law of increase of entropy given above does not itself 
contradict this symmetry, since it refers only to the most probable conse- 
quence of a macroscopically described state. In other words, if some non- 
equilibrium macroscopic state is given, the law of increase of entropy asserts 
only that, out of all the microscopic states which meet the given macroscop- 
ic description, the great majority lead to an increase of entropy at subse- 
quent instants. 

A contradiction arises, however, if we look at another aspect of the prob- 
lem. In formulating the law of increase of entropy, we have referred to the 
most probable consequence of a macroscopic state given at some instant. 
But this state must itself have resulted from some other states by means of 
processes occurring in Nature. The symmetry with respect to the two direc- 
tions of time means that, in any macroscopic state arbitrarily selected at 
some instant ¢ = fo, we can say not only that much the most probable con- 
sequence at ¢ > fo is an increase in entropy, but also that much the most 
probable origin of the state was from states of greater entropy; that is, 
the presence of a minimum of entropy as a function of time at the arbitra- 
rily chosen instant f = fo is much the most probable.' 


t For a better understanding of this symmetry, we may plot diagrammatically the 
variation of the entropy of a system that is closed during a very long interval of 
time (Fig. 1). Let a macroscopic state with entropy S = S, < Snax be observed in 


Kia. 1 


such a system, arising from a (very improbable) large fluctuation. Then we can say 
that it will be, with very high probability, a point of type 1, at which the entropy 
has reached a minimum, and not one of type 2, at which the entropy will decrease 
further. 
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This assertion, of course, is not at all equivalent to the law of increase of 
entropy, according to which the entropy never decreases (apart from en- 
tirely negligible fluctuations) in any closed systems which actually occur in 
Nature. And it is precisely this general formulation of the law of increase 
of entropy which is confirmed by all natural phenomena. It must be empha- 
sised that it is certainly not equivalent to the formulation given at the begin- 
ning of this section, as it might appear to be. In order to derive one formu- 
. lation from the other, it would be necessary to use the concept of an observer 
who artificially “creates” a closed system at some instant, so that the prob- 
lem of its previous behaviour does not arise. Such a dependence of the 
laws of physics on the nature of an observer is quite inadmissible, of course. 

It is doubtful whether the law of increase of entropy thus formulated could 
be derived on the basis of classical mechanics. Moreover, because of the 
invariance of the equations of classical mechanics under time reversal, one 
could seek only to derive a monotonic variation of entropy. In order to ob- 
tain a law of monotonic increase, we should have to define the direction of 
time as that in which the entropy increases. The problem would then arise 
of proving that such a thermodynamic definition was identical with the 
quantum-mechanical definition (see below). 

In quantum mechanics, the situation is substantially changed. The funda- 
mental equation of quantum mechanics, namely Schrédinger’s equation, is 
itself symmetrical under time reversal, provided that the wave function ¥ is 
also replaced by Y*. This means that, if at some instant t= ft, the wave 
function Y = Y(t) is given, and if according to Schrédinger’s equation 
it should become Y (tg) at some other instant fa, then the change from P(t1) 
to Y(te) is reversible; in other words, if Y = P"(te) at the initial instant 11, 
then YV = Y"(t;) at to. 

However, despite this symmetry, quantum mechanics does in fact involve 
an important non-equivalence of the two directions of time. This appears 
in connection with the interaction of a quantum object with a system which 
with sufficient accuracy obeys the laws of classical mechanics, a process 
of fundamental significance in quantum mechanics. If two interactions A and 
B with a given quantum object occur in succession, then the statement that 
the probability of any particular result of process B is determined by the 
result of process A can be valid only if process A occurred earlier than pro- 
cess B; see also Quantum Mechanics, § 7. 

Thus in quantum mechanics there is a physical non-equivalence of the 
two directions of time, and theoretically the law of increase of entropy might 
be its macroscopic expression. In that case, there must exist an inequality 
involving the quantum constant # which ensures the validity of this law and 
is satisfied in the real world. Up to the present, however, no such relation 
has been at all convincingly shown to exist. 
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The question of the physical foundations of the law of monotonic increase 
of entropy thus remains open: it may be of cosmological origin and related 
to the general problem of initial conditions in cosmology; the violation 
of symmetry under time reversal in some weak interactions between ele- 
mentary particles may play some part. The answers to such questions may 
be achieved only in the course of further synthesis of physical theories. 

Summarising, we may repeat the general formulation of the law of increase 
of entropy: in all closed systems which occur in Nature, the entropy never 
decreases; it increases, or at least remains constant. In accordance with these 
two possibilities, all processes involving macroscopic bodies are customarily 
divided into irreversible and reversible processes. The former comprise 
those which are accompanied by an increase of entropy of the whole closed 
system; the reverse processes cannot occur, since the entropy would then 
have to decrease. Reversible processes are those in which the entropy of the 
closed system remains constant,‘ and which can therefore take place in the 
reverse direction. A strictly reversible process is, of course, an ideal limit- 
ing case; processes actually occurring in Nature can be reversible only to 
within a certain degree of approximation. 


t It must be emphasised that the entropies of the individual parts of the system 
need not remain constant also. 
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CHAPTER II 


THERMODYNAMIC QUANTITIES 


§ 9. Temperature 


Thermodynamic physical quantities are those which describe macroscopic 
states of bodies. They include some which have both a thermodynamic and a 
purely mechanical significance, such as energy and volume. There are also, 
however, quantities of another kind, which appear as a result of purely 
statistical laws and have no meaning when applied to non-macroscopic 
systems, for example entropy. 

In what follows we shall define a number of relations between thermody- 
namic quantities which hold good whatever the particular bodies to which 
these quantities relate. These are called thermodynamic relations. 

When thermodynamic quantities are discussed, the negligible fluctuations 
to which they are subject are usually of no interest. Accordingly, we shall 
entirely ignore such fluctuations, and regard the thermodynamic quantities 
as varying only with the macroscopic state of the body.' 

Let us consider two bodies in thermal equilibrium with each other, forming 
a closed system. Then the entropy S of this system has its maximum value 
(for a given energy E of the system). The energy E is the sum of the energies 
FE; and EF, of the two bodies: E == E,+ 2. The same applies to the entropy S 
of the system, and the entropy of each body is a function of its energy: 
S = Sy(E£1) + S2(F2). Since Ey = E-E,, E being a constant, S$ is really a 
function of one independent variable, and the necessary condition for a 
maximum may be written 


dE, ~ dE, dE, dy 
dS, dSa _ 
dE, dE, ”’ 


whence 
dS;/dE, = dS2/dE2. 


t Fluctuations of thermodynamic quantities will be discussed in a separate 
chapter (Chapter XIJ). 
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This conclusion can easily be generalised to any number of bodies in equi- 
librium with one another. 

Thus, if a system is in a state of thermodynamic equilibrium, the deriva- 
tive of the entropy with respect to the energy is the same for every part of it, 
i.e. is constant throughout the system. A quantity which is the reciprocal 
of the derivative of the entropy S of a body with respect to its energy E is 
called the absolute temperature T (or simply the temperature) of the body: 


dS/dE = 1/T. (9.1) 


The temperatures of bodies in equilibrium with one another are therefore 
equal: 7, = T>. 

Like the entropy, the temperature is seen to be a purely statistical quantity, 
which has meaning only for macroscopic bodies. 

Let us next consider two bodies forming a closed system but not in equi- 
librium with each other. Their temperatures 7, and T2 are then different. 
In the course of time, equilibrium will be established between the bodies, 
and their temperatures will gradually become equal. During this process, 
their total entropy S = S,+S2 must increase, i.e. its time derivative is 
positive: 


cs est foes 
dt ar 
= inGEy dss GEG. 


dE, drt dE, dt 


Since the total energy is conserved, d£;/d¢+dE2/dt = 0, and so 


isfy (ce LL We a 
a7 (ae; az.) ar 7 (7; 7) dt = 


Let the temperature of the second body be greater than that of the first 
(T2 > Ti). Then dE;/dt > 0, and d£2/dt < 0. In other words, the energy 
of the second body decreases and that of the first increases. This property 
of the temperature may be formulated as follows: energy passes from bodies 
at higher temperature to bodies at lower temperature. 

The entropy S is a dimensionless quantity. The definition (9.1) therefore 
shows that the temperature has the dimensions of energy, and so can be 
measured in energy units, for example ergs. In ordinary circumstances, 
however, the erg is too large a quantity, and in practice the temperature is 
customarily measured in its own units, called degrees Kelvin or simply 
degrees. The conversion factor between ergs and degrees, i. e. the number 
of ergs per degree, is called Boltzmann’s constant and is usually denoted by 
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k; its value ist 
k = 1.38X 10-16 erg/deg. 


In all subsequent formulae the temperature will be assumed measured in 
energy units. To convert to the temperature measured in degrees, in numer- 
ical calculations, we need only replace 7 by kT. The continual use of the 
factor k, whose only purpose is to indicate the conventional units of temper- 
ature measurement, would merely complicate the formulae. 

If the temperature is in degrees, the factor k is usually included in the 
definition of entropy: 


S= klogAl, (9.2) 


instead of (7.7), in order to avoid the appearance of k in the general relations 
of thermodynamics. Then formula (9.1) defining the temperature, and there- 
fore all the general thermodynamic relations derived subsequently in this 
chapter, are unaffected by the change to degrees. 

Thus the rule for conversion to degrees is to substitute in all formulae 


T~>kT, S > Sik. (9.3) 


§ 10. Macroscopic motion 


As distinct from the microscopic motion of molecules, the macroscopic 
motion is that in which the various macroscopic parts of a body participate 
as a whole. Let us consider the possibility of macroscopic motion in a state 
of thermodynamic equilibrium. 

Let the body be divided into a large number of small (but macroscopic) 
parts, and let M,, E, and P, denote the mass, energy and momentum of the 
ath part. The entropy S, of each part is a function of its internal energy, i.e. 
the difference between its total energy E, and the kinetic energy P2/2M, of 
its macroscopic motion.? The total entropy of the body can therefore be 
written 


S = Y S,(E,—P?/2Mz). (10.1) 


t For reference, we may also give the conversion factor between degrees and 
electron-volts: 
1eV = 11,606 deg. 


+ The fact that the entropy of a body is a function only of its internal energy 
follows at once from Galileo’s relativity principle; the number of quantum states, 
and therefore the statistical weight (whose logarithm is the entropy), must be the 
same in all inertial frames of reference, and in particular that in which the body is 
at rest. 
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We shall assume that the body is a closed system. Then its total momentum 
and angular momentum are conserved, as well as its energy: 


)P, = constant, )'r,P, = constant, (10.2) 


where r, is the radius vector of the ath part. In a state of equilibrium, the 
total entropy S of the body as a function of the momenta P,, has a maximum 
subject to the conditions (10.2). Using the familiar Lagrange’s method of 
undetermined multipliers, we find the necessary conditions for a maximum 
by equating to zero the derivatives with respect to P, of the sum 


> {S,+a-P,+b-r.XP,}, (10.3) 


where a and b are constant vectors. Differentiation of S, with respect to P, 
gives,’ by the definition of the temperature, 


ro) P P, Va 
ap, 5¢(#-—397,) "MP ~~ T° 


where v, = P/M, is the velocity of the ath part of the body. Differentiation 
of (10.3) therefore gives 


—v,/T+a+bxXr, = 0, 


or 
Vg = u+2xXr,, (10.4) 


where u = 7a and $2 = Tb are constant vectors. 

This result has a simple physical significance. If the velocities of all the 
parts of a body are given by formula (10.4) with the same u and Q, this 
means that we have a translational motion of the body as a whole with con- 
stant velocity u and arotation of the body as a whole with constant angular 
velocity 2. Thus we arrive at the important result that in thermodynamic 
equilibrium a closed system can execute only a uniform translational and 
rotational motion as a whole. No internal macroscopic motion is possible 
in a state of equilibrium.? 


t The derivative with respect to a vector is to be understood as another vector 
whose components are equal to the derivatives with respect to the components. 
of the first vector. 

t To avoid misunderstanding, an important exception to this rule should be 
noted: superfluid liquid helium cannot rotate as a whole. This effect will be discussed 
elsewhere in the Course of Theoretical Physics. Here we shall only mention that 
the proof given above is invalid in that case, because the velocity distribution is 
subject to a further condition (the superfluid flow is a potential flow) and the en- 
tropy maximum is to be sought under this condition. 
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In what follows we shall usually consider bodies at rest, and the energy 
E will accordingly be the internal energy of the body. . 

So far we have made use only of the necessary condition for a maximum 
of entropy as a function of the momenta, but not of the sufficient condition 
to be imposed on the second derivatives. It is easy to see that the latter 
condition leads to the very important result that the temperature must be 
positive: J > 0.' To deduce this, it is not in fact necessary to calculate the 
second derivatives; instead, we can argue as follows. 

Let us consider a body forming a closed system, at rest as a whole. If the 
temperature were negative, the entropy would increase with decreasing argu- 
ment. Since the entropy tends to increase, the body would spontaneously 
seek to break up into dispersing parts (with total momentum 2P, = 0), 
so that the argument of each S, in the sum (10.1) should take its least possible 
value. In other words, bodies in equilibrium could not exist with 7 < 0. 

The following point should be noted, however. Although the temperature 
of a body or any part of it can never be negative, there may exist partial 
equilibria in which the temperature corresponding to a particular group of 
degrees of freedom of the body is negative. This is further discussed in 
§ 73. 


§ 11. Adiabatic processes 


Among the various kinds of external interactions to which a body is subject, 
those which consist in a change in the external conditions form a special 
group. By “external conditions” we mean in a wide sense various external 
fields. In practice the external conditions are most often determined by the 
fact that the body must have a prescribed volume. In one sense this case may 
also be regarded as a particular type of external field, since the walls which 
limit the volume are equivalent in effect to a potential barrier which prevents 
the molecules in the body from escaping. 

If the body is subject to no interactions other than changes in external 
conditions, it is said to be thermally isolated. It must be emphasised that, 
although a thermally isolated body does not interact directly with any other 
bodies, it is not in general a closed system, and its energy may vary with 
time. 

In a purely mechanical way, a thermally isolated body differs from a closed 
system only in that its Hamiltonian (the energy) depends explicitly on the 
time: E = E(p, q, 1), because of the variable external field. If the body also 
interacted directly with other bodies, it would have no Hamiltonian of its 


+ The temperature 7 = 0 (absolute zero) corresponds to —273.15°C. 
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own, since the interaction would depend not only on the coordinates of the 
molecules of the body in question but also on those of the molecules in the 
other bodies. 

This leads to the result that the law of increase of entropy is valid not 
only for closed systems but also for a thermally isolated body, since here 
we regard the external field as a completely specified function of coordinates 
and time, and in particular neglect the reaction of the body on the field. 
That is, the field is a purely mechanical and not a statistical object, whose 
entropy can in this sense be taken as zero. This proves the foregoing state- 
ment. 

Let us suppose that a body is thermally isolated, and is subject to external 
conditions which vary sufficiently slowly. Such a process is said to be adiabat- 
ic. We shall show that, in an adiabatic process, the entropy of the body re- 
mains unchanged, i.e. the process is reversible. 

We shall describe the external conditions by certain parameters which 
are given functions of time. For example, suppose that there is only one 
such parameter, which we denote by 2. The time derivative dS/dt of the 
entropy will depend in some manner on the rate of variation d//dt of the 
parameter 4. Since dA/dt is small, we can expand dS/d¢ in powers of dA/dt. 
The zero-order term in this expansion, which does not involve dA/d?, is zero, 
since if dA/d¢ = 0 then dS/dt = 0 also, because the entropy of a closed 
system in thermodynamic equilibrium must remain constant under constant 
external conditions. The first-order term, which is proportional to d//dt, 
must also be zero, since this term changes sign with dA/dt, whereas dS/d¢ 
is always positive, according to the law of increase of entropy. Hence it 
follows that the expansion of dS/df begins with the second-order term, 
i.e. for small d//dt we have 


dS/dt = A(da/dz), 
or 
dS/da = A da/dt. 


Thus, when dA/df tends to zero, so does dS/d/, which proves that the adia- 
batic process is reversible. 

It must be emphasised that, although an adiabatic process is reversible, 
not every reversible process is adiabatic. The condition for a process to be 
reversible requires only that the total entropy of the whole of a closed 
system be constant, while the entropies of its individual parts may either 
increase or decrease. In an adiabatic process, a stronger condition holds: 
the entropy of a body which is only a part of a closed system also remains 
constant. 

We have defined an adiabatic process as one which is sufficiently slow. 
More precisely, we can say that the external conditions must change so 


40 Thermodynamic Quantities 


slowly that at any instant the body may be regarded as being in a state of 
equilibrium corresponding to the prevailing external conditions. That is, 
the process must be slow in comparison with the processes leading to the 
establishment of equilibrium in the body concerned.t 

We may derive a formula to calculate by a purely thermodynamic method 
various mean values. To do so, we assume that a body undergoes an adiabat- 
ic process, and determine the time derivative dE/dt of its energy. By defi- 
nition, the thermodynamic energy is 


E = E(p, q; A), 


where E(p, q; 4) is the Hamiltonian of the body, depending on A as a para- 
meter. We know from mechanics that the total time derivative of the 
Hamiltonian is equal to its partial time derivative (see Mechanics, § 40): 


dE(p,934) _ OE(p, 93 4) 
dt = ot ; 


In the present case E(p, g; 4) depends explicitly on the time through A(/), and 
we can therefore write 


dE(p, 9; 4) _ OE(p, 9g; 4) da 
dt = oA dt* 


Since the operations of averaging over the statistical distribution and differ- 
entiating with respect to time can clearly be interchanged, we have 


dE _ dE(p,q; 4) _ dE(p, 934) da | (11.1) 
di dt oA dt ’ 


the derivative dA/dt is a given function of time, and can be taken outside the 
averaging. 

It is very important that, since the process is adiabatic, the mean value of 
the derivative GE(p, q; A)/OA in (11.1) can be taken as the mean value over 


t In practice this may be a very weak condition, so that the “slow” adiabatic 
process may be quite a “fast” one. For example, in the expansion of a gas, say in 
a cylinder with a piston moving outwards, the speed of the piston need be small 
only compared with the velocity of sound in the gas, i.e. it may in practice be very 
large. 

In general textbooks on physics an adiabatic expansion (or compression) is often 
defined as one which is “sufficiently rapid”. This refers to a different aspect of the 
problem: the process must occur so rapidly that the body cannot exchange heat 
with the surrounding medium. Thus the condition in question is one which will 
in practice ensure that the body is thermally isolated, and the condition of slowness 
compared with processes leading to the establishment of equilibrium is tacitly 
assumed satisfied. 
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the statistical distribution corresponding to equilibrium for a given value of 
the parameter A, i.e. for the external conditions prevailing at a given instant. 

The derivative dE/dt can also be written in another form by regarding the 
thermodynamic quantity FE as a function of the entropy S of the body and 
the external parameters 7. Since, in an adiabatic process, the entropy S re- 
mains constant, we have 


(11.2) 


dE /(dE\ da 
aw (a) 


7”) “1? 
dr. g it 
where the subscript to the parenthesis indicates that the derivative is taken 


for constant S. 
Comparison of (11.1) and (11.2) shows that 


OE(p,q3 4) _ () 

— OA «NA OATS” 
This is the required formula. It enables us to calculate thermodynamically 
the mean values (over the equilibrium statistical distribution) of quantities 
of the form 0E(p, q; 2)/0A. Such quantities are continually encountered when 
studying the properties of macroscopic bodies, and in consequence formula 
(11.3) is of great importance in statistical physics. It appears in the calcula- 
tion of various forces acting on a body (the parameters 7 being the coordi- 
nates of a particular part of the body; see§ 12), the calculation of the magnet- 
ic or electric moment of bodies (the parameters 4 being the magnetic or 
electric field strengths), and so on. 

The arguments given here for classical mechanics are entirely applicable 
to the quantum theory, except that the energy E(p, q; 2) must be everywhere 
replaced by the Hamiltonian operator A. Then formula (11.3) becomes 

on (22) 
eA \AA7s’ 


(11.3) 


(11.4) 


the bar denoting complete statistical averaging (which automatically in- 
cludes the quantum averaging). 


§ 12. Pressure 


The energy E of a body, as a thermodynamic quantity, has the property of 
being additive: the energy of the body is equal to the sum of the energies of 
its individual (macroscopic) parts.’ Another fundamental thermodynamic 
quantity, the entropy, also has this property. 


t In so far as we neglect the energy of interaction of these parts; this is not per- 
missible if we are interested in effects arising from the presence of interfaces between 
different bodies. Chapter XV deals with this topic. 
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The additivity of the energy and the entropy leads to the following im- 
portant result. If a body is in thermal equilibrium, we can say that, for a 
given energy, the entropy depends only on the volume of the body, and not 
on its shape; the same is true of the energy for a given entropy.’ For a 
change in the shape of the body can be regarded as a rearrangement of its 
individual parts, and so the entropy and energy, being additive, will remain 
unchanged. Here, of course, it is assumed that the body is not in an external 
field of force, so that the motion of the parts of the body in space does not 
involve a change in their energy. 

Thus the macroscopic state of a body at rest in equilibrium is entirely 
determined by only two quantities, for example the volume and the energy. 
All other thermodynamic quantities can be expressed as functions of these 
two. Of course, because of this mutual dependence of the various thermody- 
namic quantities, any other pair could be regarded as the independent 
variables. 

Let us now calculate the force exerted by a body on the surface bounding 
its volume. According to the formulae of mechanics, the force acting on a 


surface element ds is 
F = —0E(p, q; r)/or, 


where E(p, q; r) is the energy of the body as a function of the coordinates 
and momenta of its particles and of the radius vector of the surface element 
considered, which here acts as an external parameter. Averaging this equa- 
tion and using formula (11.3), we obtain 


Poo Sew) (2) " -(#) ov 
or/s  \WV/s5 or’ 
where V is the volume. Since the change in volume is ds-dr, we have OV /Or = 


ds, the surface element, and so 
F = —(dE/OV)s ds. 


Hence we see that the mean force on a surface element i$ normal to the ele- 
ment and proportional to its area (Pascal’s law). The magnitude of the 


force per unit area is 
P=-—(0E/OV)s. (12.1) 


This quantity is called the pressure. 


t These statements are applicable in practice to liquids and gases but not to 
solids. A change in shape (deformation) of a solid involves the doing of work, so 
that the energy of the body is changed. This is because the deformed state of the 
solid is, strictly speaking, an incomplete thermodynamic equilibrium (but the 
relaxation time for the establishment of complete equilibrium is so long that in 
many respects the deformed body behaves as if in equilibrium). 
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In defining the temperature by formula (9.1) we were essentially consider- 
ing a body which is not in direct contact with any other bodies, and in 
particular is not surrounded by any external medium. Under these conditions 
it was possible to speak of the change in energy and entropy of the body 
without making more specific the nature of the process. In the general case 
of a body in an external medium, or surrounded by the walls of a vessel, 
formula (9.1) must be made more precise. For if during the process the 
volume of the body changes, this will necessarily affect the state of the bodies 
in contact with it, and in order to define the temperature we should have to 
take into consideration at the same time all the bodies in contact (for exam- 
ple, both the body in question and the vessel containing it). If it is desired to 
define the temperature in terms of thermodynamic quantities for the given 
body only, its volume must be regarded as constant. In other words, the 
temperature is defined as the derivative of the energy of the body with respect 
to its entropy, taken at constant volume: 


T = (BE/OS\y. (12.2) 


The equations (12.1), (12.2) can also be written together as a relation 
between differentials: — 


dE = TdS—Pdr. (12.3) 


This is one of the most important relations in thermodynamics. 

The pressures of bodies in equilibrium with one another are equal. This 
follows immediately from the fact that thermal equilibrium necessarily 
presupposes mechanical equilibrium; in other words, the forces exerted 
on each other by any two of these bodies at their surface of contact must be 
equal in magnitude and opposite in direction, and thus balance. 

The equality of pressures in equilibrium can also be derived from the 
condition of maximum entropy, in the same way as the equality of tempera- 
tures was shown in § 9. To do this, we consider two parts, in contact, of a 
closed system in equilibrium. One necessary condition for the entropy to be 
a maximum is that it should be a maximum with respect to a change in the 
volumes V; and Vs of these two parts when the states of the other parts 
undergo no change(this means, in particular, that V; + V_ remains constant). 
If the entropies of the two parts are S; and Ss, we have 

oS OS: OSs Oe O89; Se 


Wi Vit Ws WW We =” 


From the relation (12.3) in the form 


1 


P 
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it is seen that 0S/OV = P/T, and so P1/T; = P2/T2. Since the temperatures 
T1 and T2 are the same in equilibrium, we therefore find that the pressures 
are equal, P; = Po. - 

It must be remembered that, when thermal! equilibrium is established, the 
equality of pressures (i.e. mechanical equilibrium) is reached much more 
rapidly than that of temperatures, and so cases are often met with in which 
the pressure is constant throughout a body but the temperature is not. The 
reason is that the non-constancy of pressure is due to the presence of uncom- 
pensated forces; these bring about macroscopic motion so as to equalise 
the pressure much more rapidly than the equalisation of temperature, which 
does not involve macroscopic motion. 

It is easy to see that the pressure must be positive in any equilibrium state: 
when P > 0 we have (6S/6V), => 0, and the entropy could increase only by 
an expansion of the body, which is prevented by the surrounding bodies. 
If P < 0, however, then we should have (6S/6V),; < 0, and the body would 
spontaneously contract so as to increase its entropy. 

There is, however, an important difference between the requirements of 
positive temperature and positive pressure. Bodies of negative temperature 
would be completely unstable and cannot exist in Nature. States (non- 
equilibrium) of negative pressure can exist in Nature with restricted stability. 
The reason is that the spontaneous contraction of the body involves “de- 
taching” it from the walls of the vessel or the formation of cavities within 
it, that is, the formation of a new surface, and this leads to the possibility 
of the existence of negative pressures in what are called metastable states." 


§ 13. Work and quantity of heat 


The external forces applied to a body can do work on it, which is deter- 
mined, according to the general rules of mechanics, by the products of these 
forces and the displacements which they cause. This work may serve to 
bring the body into a state of macroscopic motion (or in general to change 
its kinetic energy), or to move the body in an external field (for instance, to 
raise it against gravity). We shall, however, be mainly interested in cases 
where the volume of a body is changed as a result of work done on it (i.e. 
the external forces compress the body but leave it at rest as a whole). 

We shall everywhere regard as positive an amount of work R done on a 
given body by external forces. Negative work (R < 0) will correspondingly 
mean that the body itself does work (equal to | R|) on some external objects 
(for example, in expanding). 


t These are defined in § 21. Negative pressures are further discussed in the foot- 
note following (83.1). 
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Bearing in mind that the force per unit area of the surface of the body is. 
the pressure, and that the product of the area of a surface element and its 
displacement is the volume swept out by it, we find that the work done on 
the body per unit time when its volume changes is 


dR/dt = —P dV dt; (13.1) 


in compression, dV’/dt < 0, so that dR/dt > 0. This formula is applicable 
to both reversible and irreversible processes; only one condition need be 
satisfied, namely that throughout the process the body must be in a state of 
mechanical equilibrium, i.e. at each instant the pressure must be constant 
throughout the body. 

If the body is thermally isolated, the whole of the change in its energy is 
due to the work done on it. In the general case of a body not thermally iso- 
lated, in addition to the work done, the body gains or loses energy by direct 
transfer from or to other bodies in contact with it. This part of the change in 
energy is called the quantity of heat Q gained or lost by the body. Thus the 
change in the energy of the body per unit time may be written 


dE dR. dQ 
ae aE + ar (13.2) 
Like the work, the heat will be regarded as positive if gained by the body 
from external sources. 

The energy £ in (13.2) must, in general, be understood as the total energy 
of the body, including the kinetic energy of its macroscopic motion. We 
shall, however, usually consider the work corresponding to the change in 
volume of a body at rest, in which case the energy reduces to the internal 
energy of the body. 

Under conditions where the work is defined by formula (13.1), we have for 
the quantity of heat 

dQ dE dV 
“ar = Tips er Vas (13.3) 

Let us assume that at every instant throughout the process the body may be 
regarded as being in a state of thermal equilibrium corresponding to its 
energy and volume at that instant; it must be emphasised that this does not 
mean that the process is necessarily reversible, since the body may not be in 
equilibrium with surrounding bodies. Then, from the relation (12.3), which 
gives the differential of the function E(S, V), the energy of the body in the 
equilibrium state, we can put 

dE dS dV 


a ar ae 
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Comparison with (13.3) shows that 
dQ/dt = 7 dS/dt. (13.4) 


The work dR and the quantity of heat dQ gained by the body in an infini- 
tesimal change of state are not the total differentials of any quantities. Only 
the sum dQ + dR, i.e. the change in energy dE, is a total differential. We can 
therefore speak of the energy £ in a given state, but not, for example, of the 
quantity of heat which a body possesses in a given state. In other words, the 
energy of the body cannot be divided into thermal and mechanical parts; this _ 
is possible only when considering the change in energy. The change in energy 
when a body goes from one state to another can be divided into the quantity 
of heat gained or lost by the body and the work done on it or by it. This 
division is not uniquely determined by the initial and final states of the body, 
but depends also on the nature of the process itself. That is, the work and the 
quantity of heat are functions of the process undergone by the body and 
not only of its initial and final states. This is seen particularly when the body 
undergoes a cyclic process, starting and finishing in the same state. The 
change in energy is then zero, but the body may gain or lose a quantity of 
heat or work. Mathematically this corresponds to the fact that the integral 
of the total differential dE around a closed circuit is zero, but the integral 
of dQ or dR, which are not total differentials, is not zero. 

The quantity of heat which must be gained in order to raise the tempera- 
ture of the body by one unit is called its specific heat. This clearly depends 
on the conditions under which the heating takes place. A distinction is usu- 
ally made between the specific heat at constant volume C,, and that at con- 
Stant pressure C,. Clearly 


Cy = T(BS/OT)y, (13.5) 
C, = T(dS/eT)p. (13.6) 


Let us consider cases where formula (13.4) for the quantity of heat is in- 
applicable, but at the same time it is possible to establish certain inequalities 
for this quantity. There exist processes in which the body is not in thermal 
equilibrium although the temperature (and pressure) are constant throughout 
the body; for example, chemical reactions in a homogeneous mixture of 
reactants. Owing to the irreversible process (the chemical reaction) occurring 
in the body, its entropy increases independently of the heat gained, and so 
we can say that the inequality 

dQ/dt < T dS/dt (13.7) 
holds. 


t In this sense the notation dR and dQ is not quite precise, and we therefore 
avoid it as far as possible. 


§ 14 The Heat Function 47 


Another case where a similar inequality can be stated is an irreversible 
process in which the body goes from one equilibrium state to another 
neighbouring one but is not in equilibrium during the process.‘ Then the 
inequality 

6Q <TOdS (13.8) 
holds between the quantity of heat 6Q gained by the body in this process 
and its entropy change 6S. 


§ 14. The heat function 


If the volume of a body remains constant during a process, then dQ = dE, 
i.e. the quantity of heat gained by the body is equal to the change in its 
energy. If the process occurs at constant pressure, the quantity of heat can 
be written as the differential 


dQ = d(E+PV) = dW (14.1) 


of a quantity 
W = E+PV, (14.2) 


called the heat function of the body.+ Thechange in the heat function in pro- 
cesses occurring at constant pressure is therefore equal to the quantity of 
heat gained by the body. 

It is easy to find an expression for the total differential of the heat function. 
Putting dE = TdS—PdV and dW = dE+PdV+YV dP, we have 


dW = TdS+V dP. (14,3) 
From this it follows that 


T= (8W/OS)p, V =(OW/OP)s. (14.4) 


If the body is thermally isolated (which, it will be remembered, does not 
imply that it is a closed system), dQ = 0, and (14.1) shows that, in processes 
occurring at constant pressure and involving a thermally isolated body, 


W = constant, (14.5) 


i.e. the heat function is conserved. 
The specific heat C, can be written, using the relation dE = TdS—P dV, 


as 
Cy = (BE/OT)p. (14.6) 


+ An example is the Joule-Thomson process (see§ 18) with a small change in 
pressure. 
t Also called the enthalpy or heat content. 
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Similarly, we have for the specific heat C, 
C, = (OW/eT)p. 


We see that at constant pressure the heat function has properties similar 
to those of the energy at constant volume. 


§ 15. The free energy and the thermodynamic potential 


The work done on a body in an infinitesimal isothermal reversible change of 
state can be written as a differential: 


dR = dE—dQ = dE-TdS 
= d(E—-TS) 
or 
dR = dF, (15.1) 
where 
F = E-TS (15.2) 


is another function of the state of the body, called the free energy. Thus the 
work done on the body in a reversible isothermal process is equal to the 
change in its free energy. 

Let us find the differential of the free energy. Substituting dE = T7dS— 
P dV and dF = dE—T dS—S dT, we have 


dF =—S dT—P dv. (15.3) 


Hence it is evident that 
S =—(0F/0T)p, = —(dF/dV)r. (15.4) 


Using the relation E = F+T7S, we can express the energy in terms of the 
free energy as 
E = F-T(6F/0T)y 


0 6F 
= Tar r),- (15.5) 

Formulae (12.1), (12.2), (14.4) and (15.4) show that, if we know any of the 
quantities L, W and F as a function of the corresponding two variables and 
take its partial derivatives, we can determine all the remaining thermody- 
namic quantities. For this reason E, W and F are called thermodynamic poten- 
tials (by analogy with the mechanical potential) or characteristic functions: 
the energy E£ with respect to the variables S, V; the heat function W with 
respect to S, P; the free energy F with respect to V, T. 
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We still lack a thermodynamic potential with respect to the variables P, T 
To derive this we substitute in (15.3) PdV = d(PV)—V dP, take d(PV) to 
the left-hand side of the equation, and obtain 

d® =—SdT+V dP, (15.6) 
with a new quantity 
® = E-TS+PV 
= F+PV 
= W—TS, (15.7) 


called the thermodynamic potential (in a restricted sense of the term).* 
From (15.6) we clearly have 


S=—(0O/eT)p, V = (0O/oP)r. (15.8) 
The heat function is expressed in terms of @ in the same way as £ in terms 
of F: 
W = @-T(0®/0T)p 


= -1"(er 7), (15.9) 


If there are other parameters A, besides the volume which define the state 
of the system, the expression for the differential of the energy must be aug- 
mented by terms proportional to the differentials dA,: 


dE = TAS—PaV +) Ar dh, (15.10) 


where the A, are some functions of the state of the body. Since the transfor- 
mation to other potentials does not affect the variables 7,, it is clear that 
similar terms will be added to the differentials of F, ®, W: 


dF = —SaT—PaV + yA, dai, 


etc. Hence the quantities 4, can be obtained by differentiation with respect to 
A, of any of these potentials (it must be remembered which other’ variables 
are treated as constant in the differentiation). Using also formula (11.3), we 
can write down the analogous relation 


OE(p, 934) _ (7) 
a bay (15.11) 


t'In Western literature, the functions F and ® are often called respectively the 
Helmholtz free energy and the Gibbs free energy. 
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which expresses the mean value of the derivative of the Hamiltonian with 
respect to any parameter as the derivative of the free energy with respect to 
that parameter (and similar relations involving the derivatives of ® and W). 

The following point may be noted. If the values of the parameters /, 
change slightly, the quantities E, F, W and ® will also undergo small changes. 
It is evident that these changes will be equal if each is considered for the 
appropriate pair of constant quantities: 


(dE)s, vr (6F)z,r = (OW)s, p = (6®)rz p. (15.12) 


This statement, which we call the theorem of small increments, will be 
used several times below. 

The free energy and the thermodynamic potential have an important 
property which determines the direction in which they change in various 
irreversible processes. From the inequality (13.7), substituting dQ/dt from 


(13.3), we obtain 
dE dV dS 
Let us assume that the process is isothermal. and occurs at constant volume 
(T = constant, V = constant). Then this inequality may be written 
d(E-TS) dF 

Thus irreversible processes occurring at constant temperature and constant 
volume are accompanied by a decrease in the free energy of the body. 

Similarly, for P = constant and T = constant the inequality (15.13) be- 


comes 
d@/dt < 0; (15.15) 


that is, irreversible processes occurring at constant temperature and constant 
pressure are accompanied by a decrease in the thermodynamic potential.* 

Correspondingly, in a state of thermal equilibrium the free energy and the 
thermodynamic potential have minimum values, the former with respect to 
all changes of state with T and V constant, and the latter with respect to 
changes of state with T and P constant. 


PROBLEM 


How can the mean kinetic energy of the particles in a body be calculated if the 
formula for its free energy is known? 


t It should be remembered that in both cases the processes in question are those 
(such as chemical reactions) for which the body is not in equilibrium, so that its 
state is not uniquely defined by the temperature and the volume (or pressure). 
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SOLUTION. The Hamiltonian function (or, in the quantum case, the Hamilto- 
nian operator) may be written in the form E(p, gq) = U(q)+K(p), where U(q) is the 
potential energy of interaction of the particles in the body, and K(p) their kinetic 
energy. The latter is a quadratic function of the momenta, inversely proportional 
to the particle mass m (for a body consisting of identical particles). Regarding m as 
a parameter, we can therefore write 


OE(p,q;m) st 
“Soe 


Then, applying formula (15.11), we obtain the mean kinetic energy K = K(p): 
K= —m(0F, / Om)r, ye 


§ 16. Relations between the derivatives of thermodynamic quantities 


In practice the most convenient, and the most widely used, pairs of ther- 
modynamic variables are T, V and T, P. It is therefore necessary to transform 
various derivatives of the thermodynamic quantities with respect to one 
another to different variables, both dependent and independent. 

If V and T are used as independent variables, the results of the transforma- 
tion can be conveniently expressed in terms of the pressure P and the specific 
heat C, (as functions of V and T). The equation which relates the pressure, 
volume and temperature is called the equation of state for a given body. 
Thus the purpose of the formulae in this case is to make it possible to calcu- 
late various derivatives of thermodynamic quantities from the equation of 
state and the specific heat C,,. 

Similarly, when P and T are taken as the basic variables the results of the 
transformation should be expressed in terms of V and C, (as functions of 
PandT). 

Here it must be remembered that the dependence of C, on V or of C, on P 
(but not on the temperature) can itself be determined from the equation of 
state. It is easily seen that the derivative (6C,,/OV), can be transformed so that 
it is defined in terms of the function P(V,T). Using the fact that S = 
—(6F/68T),, we have 


Cr) _ op OS _ op OF 
(ar), Vor ovor 
Oo? /OF 
=-T 37 (5p) > 


and since (OF/6V), = —P, we have the required formula 


(8C,/8V) 7 = T(82P/OT)p.. (16.1) 
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Similarly we find 
(0C,/0P)r = —T(0*V /0T”)p, (16.2) 


formulae (15.8) being used in the calculation. 

We shall show how some of the thermodynamic derivatives most often 
encountered may be transformed. 

The derivatives of the entropy with respect to volume or pressure can be 
calculated from the equation of state by means of the following formulae, 
which are a direct consequence of the expressions for the differentials of ine 
thermodynamic quantities. We have 


a8) __ 0 (ary __ 8 (ar 
(a7), --av (ar), ar (ar), 


or 
(85/aV)r = (OP/aT)y. (16.3) 
Similarly 
os ao /(e® 0 /o® 
(a), ~~ ar (ar), ~~ ar (ze), 
or : 
(OS/OP)r = —(6V/OT)p. (16.4) 


The derivative (OE/OV), is calculated from the equation 
dE = TdS—P dv 
as 


or, substituting (16.3), 


OE oP 
(a). = (Sr) aed (16.5) 
Similarly we can derive 


(),--(@)-", a 
(),- 8) (8) (BF), 1-2) 0 
(B),-o-r), (BF) ar ,-o 


or or 
Finally, we shall show how the specific heat C, may be calculated from 
the specific heat C, and the equation of state, using T and P as the basic 
variables. Since C, = T(0S/OT),, we have to transform the derivative 
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(8S/8T), to different independent variables. A transformation of this type 
is most simply effected by the use of Jacobians.t We write 
C, = T(0S/aT)v 
= Tas, V)/a(T, V) 

a(S, V)/a(T, P) 
Oo, VOT, P) 
_ pOS/6T)p (OV /OP)r—(OS/OP)r (EV OL )p 

(OV /OP)r 
(OS/OP)r (OV /OT)p 

(OV /OP)r 


=T 


Substituting (16.4), we obtain the required formula: 
Cp—Cy = —T[(OV/0T) (OV /OP)r. (16.9) 


Similarly, transforming C, = T(0S/0T)p to the variables T, V, we can 
derive the formula 


C,—Cy = —T[(OP/8T)y]}2/(OP/8V )r. (16.10) 


The derivative (@P/dV), is negative: in an isothermal expansion of a body, 
its pressure always decreases. This will be rigorously proved in § 21. It 
therefore follows from (16.10) that for all bodies 


Ci Cx (16.11) 


* The Jacobian O(u, v)/O(x, y) is defined as the determinant 
O(u,v) Ou/ox Ou/dy | 


O(x, ») (Ov/Ox Ov/oy 3 @ 
It clearly has the following properties: 
O(v,u) «Ou, v) 
Oxy) (x, »)’ up) 
Ou, y) _ (Ou 
Oxy) (&:) ” 
The following relations also hold: 
Ou, v) Otu,v) O(t, 5) 
SS i Sid ta 2a Ah Pp eS IV 
BGe.9) ~ 8G, 9) 8G,9)’ ~~ 
qd Ou, v) O(du/dt, v) ou, dv/dt) (v) 
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In adiabatic expansion (or contraction) of a body its entropy remains 
constant. The relation between the temperature, volume and pressure of the 
body in an adiabatic process is therefore determined by various derivatives 
taken at constant entropy. We shall derive formulae whereby these derivatives 
may be calculated from the equation of state of the body and its specific 
heat. 

For the derivative of the temperature with respect to volume we have, 
changing to independent variables V, T, 


(=) _ AT,S) _ AT, S)/aV,T) 


VW), B.S) OV, S\/aV,T) 


_ _ (0S/oV)r 
~ (AS/OT)y 


__ 7 as 
--a (a7), 


or, substituting (16.3), 


ar T (oP 

(er). --= (sr) - (16.12) 
Similarly we find 

ar\  T [av 

sa pas uae (16.13) 

(aP),-c (e), 


These formulae show that, according as the thermal expansion coefficient 
(OV /dT)p is positive or negative, the temperature of the body falls or rises in 
an adiabatic expansion.' 

Let us next calculate the adiabatic compressibility (@V/OP), of the body, 
writing 

(5) OV, S)_ aV, S/OV,T) OV,T) _ (S/eT)y (=) 

oP 


we renee nn a eet ee, —— eo S ceenaammendl 


OP), 
or 


ov Cy (%) 
OLE Ve cs SOY (16.14) 
(ar), C, \OP),, 


The inequality C, > C,, therefore implies that the adiabatic compressibility 
is always smaller in absolute value than the isothermal compressibility. 


T In § 21 it will be shown rigorously that C, is always positive, and therefore so 
is C,. 
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Using formulae (16.9) and (16.10), we can derive from (16.14) the relations 


(aP),~ (se),*z [(er) 


7, (\-E (GT mm 


§ 17. The thermodynamic scale of temperature 


2 
| (16.15) 


We shall show how a thermodynamic scale of temperature may be construct- 
ed, at least in principle, using for this purpose an arbitrary body whose 
equation of state is not assumed known a priori. The problem is thus to 
establish by means of this body the relation T = T(x) between the absolute 
scale of temperature J and some purely arbitrary scale + defined by an 
arbitrarily calibrated “thermometer”. 

To do this, we start from the following relation (in which all quantities 
refer to the body in question): 


(8Q/OP)r = T(@S/OP)r = —T(OV /AT)p,’ 


where (16.4) has been used. Since z and T are in one-to-one relation, it does 
not matter whether the derivative is written for constant T or constant rt. 
The derivative (OV /dT)p may be written as 


ev) _ (%) a 
(ar) = (Se) ar 

80) __7 (2%) a 
(ap) =- (x) 


dlogT (0V/Or)p 
dx (oQ/aP), 


Then 


or 
(17.1) 


The right-hand side involves quantities which can be measured directly as 
functions of the arbitrary temperature 7: (0Q/0P), is the quantity of heat 
which must be supplied to the body in order to maintain its temperature 
constant during expansion, and the derivative (OV /Or)p is determined by 
the change in volume of the body on heating. Thus formula (17.1) gives the 
solution of the problem and can be used to determine the required relation 
T = T(t). 

Here it must be remembered that the integration of (17.1) determines 
log T only to within an additive constant. The temperature T is therefore 


56 Thermodynamic Quantities 


determined only to within an arbitrary constant factor. This is as it should 
be, of course: the choice of the units of measurement of the absolute temper- 
ature remains arbitrary, which is equivalent to the presence of an arbitrary 
factor in the function T = T(z). 


§ 18. The Joule-Thomson process 


Let us consider a process which consists in a gas (or liquid) at pressure P; 
being steadily transferred to a vessel where its pressure is P2. By “steadily” we 
mean that the pressures P, and P, remain constant throughout the process. 
Such a process may be diagrammatically represented as a passage of the gas 
through a porous partition (a in Fig. 2), the constancy of pressure on either 


side of the partition being maintained by pistons moving inward and out- 
ward in an appropriate manner. If the holes in the partition are sufficiently 
small, the macroscopic flow velocity of the gas may be taken as zero. We 
shall also assume that the gas is thermally isolated from the external medium. 

This process is called a Joule-Thomson process. It must be emphasised 
that it is an irreversible process, as may be seen simply from the presence 
of the partition with very small holes, which creates a large amount of fric- 
tion and destroys the velocity of the gas. 

Let a quantity of gas, occupying a volume V; at pressure P,, pass (ther- 
mally isolated) into the volume V2, the pressure becoming equal to P:. The 
change in energy E2—E; of this gas is equal to the work P,V; done on the 
gas to move it out of the volume V1, minus the work P2V2 done by the gas 
in occupying the volume V2 at pressure Po. Thus E2—E; = PyV1—P2V2, or 


E,+PW 1 = Fot+PV2, 
that is, 
Wy = Wy. (18.1) 


Thus the heat function of the gas is conserved in a Joule-Thomson process. 

The change in temperature caused by a small change of pressure in a Joule— 
Thomson process is given by the derivative 67/0P taken with the heat 
function constant. We may transform this derivative to independent vari- 


§ 19 Maximum Work 57 
ables P and7: 


(2). So ees _ @WLP)r 
Ww 


ren Fa 


oP OP,W)  OP,W)OP,T) ~ = (OW eT)’ 


whence, by means of formulae (14.7) and (16.7), we obtain 
or 1 OV 
(),-e Ge); 


The change in entropy is given by the derivative (0S/0P),,. From the rela- 
tion dW = TdS+V dP, written in the form dS = dW/T—V dP/T, we 
have 


(OS/OP)y = —V/T. (18.3) 


This quantity is always negative, as it should be: the change of a gas to a 
lower pressure by an irreversible Joule-Thomson process results in an in- 
crease in entropy. 

We may add a few words concerning a process in which a gas originally 
in one of two communicating vessels expands into the other vessel; this 
process, of course, is not a steady one, the pressures in the two vessels varying 
until they become equal. When a gas expands into a vacuum in this way, its 
energy E is conserved. If, as a result of the expansion, the total volume is 
changed only slightly, the change in temperature is given by the derivative 
(87/8V),. On converting this derivative to independent variables V, T, we 


obtain the formula 
or j oP 
ee ee Die ey uf 18.4 
(ar), c |? * (27), i 
The change in entropy is given by 
(OS/OV )z = P/T. (18.5) 


The entropy increases on expansion, as it should. 


§ 19. Maximum work 


Let us consider a thermally isolated system consisting of several bodies not 
in thermal equilibrium with one another. While equilibrium is being estab- 
lished, the system may do work on some external objects. The transition to 
equilibrium may, however, occur in different ways, and the final equilibrium 
states of the system will also be different; in particular, its energy and entropy 
will be different. 
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Accordingly, the total work which can be got from a non-equilibrium 
system will depend on the manner in which equilibrium is established, and we 
may ask how the equilibrium state must be reached in order that the system 
should do the maximum possible amount of work. Here we are concerned 
with the work done because the system is not in equilibrium; that is, we must 
exclude any work done by a general expansion of the system, since this work 
could also be done by a system in equilibrium. We shall therefore assume 
that the total volume of the system is unchanged by the process (although 
it may vary during the process). 

Let the original energy of the system be Ep, and the energy in the equi- 
librium state, as a function of the entropy of the system in that state, be 
E(S). Since the system is thermally isolated, the work which it does is just 
the change in energy: 


|R| = Eo—E(S); 


we write |R], since R < 0 in accordance with convention if work is done 
by the system. 
Differentiating | R| with respect to the entropy S of the final state, we have 


0|R[/oS = —(E/OS)y = —T, 


where T is the temperature of the final state; the derivative is taken with the 
volume of the system in its final state constant (the same as in the initial 
state). We see that this derivative is negative, i.e.| R| decreases with increas- 
ing S. The entropy of a thermally isolated system cannot decrease, and the 
greatest possible value of | R| therefore occurs if S remains constant through- 
out the process. 

Thus we conclude that the system does maximum work when its entropy 
remains constant, i.e. when the process of reaching equilibrium is revers- 
ible. 

Let us determine the maximum work which can be done when a small 
quantity of energy is transferred between two bodies at different tempera- 
tures T; and To, with Tz > 7}. First of all, it must be emphasised that, if the 
energy transfer occurred directly between the bodies on contact, no work 
would be done. The process would be irreversible, the entropy of the two 
bodies increasing by dE (1/T1— 1/T2), where 6F is the amount of energy trans- 
ferred. 

Consequently, in order to achieve a reversible transfer of energy and so 
maximise the work, some further body (the working medium) must be brought 
into the system and caused to execute a reversible cyclic process. This pro- 
cess must be carried out in such a way that the bodies between which direct 
transfer of energy occurs are at the same temperature. The working medium 
at temperature Ts is brought into contact with the body at that temperature 
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and receives a certain amount of energy from it isothermally. It is then adia- 
batically cooled to T;, releases energy at this temperature to the body at 7, 
and finally is adiabatically returned to its original state. In the expansions 
involved in this process the working medium does work on external objects. 
The cyclic process just described is called a Carnot cycle. 

‘To calculate the resulting maximum work, we first note that the working 
medium may be ignored, since it is returned to its initial state at the end of 
the process. Let the hotter body 2 lose an amount of energy —dE2 = —T2 dS 
and body 1 gain energy 6£1 = 71 6S1. Since the process is reversible, the 
sum of the entropies of the two bodies remains constant, i.e. 6S, = — 6S. 
The work done is equal to the decrease in the total energy of the two bodies, 
i.e. 

|OR| max = —OdF,—6EF. = —T; 0S1:—T2 dS 
= —(T2—T)) 682, 
or 


— T2-T 


|OR|max = T; | 6E2!. (19.1) 


The ratio of the work done to the amount of energy expended is called 
the efficiency n. The maximum efficiency when energy is transferred from a 
hotter to a cooler body is, from (19.1), 


Ymax = (T2:—T1)/T2. (19.2) 


A more convenient quantity is the utilisation coefficient n, defined as the ratio 
of the work done to the maximum work which can be obtained in given 


conditions. Clearly 
n= 7/Nmax+ (19.3) 


§ 20. Maximum work done by a body in an external medium 


Let us now consider a different formulation of the maximum-work prob- 
lem. Let a body be in an external medium whose temperature 7» and pressure 
Po differ from the temperature 7 and pressure P of the body. The body can do 
work on some object, assumed thermally isolated both from the medium and 
from the body. The medium, together with the body in it and the object on 
which work is done, forms a closed system. The volume and energy of the 
medium are so large that the change in these quantities due to processes in- 
volving the body does not lead to any appreciable change in the temperature 
and pressure of the medium, which may therefore be regarded as constant. 
_. If the medium were absent, the work done by the body on the thermally 
isolated object, for a given change in state of the body (i.e. for given initial 
and final states) would be completely defined, and equal to the change in the 
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energy of the body. The presence of the medium which also takes part in the 
process makes the result indefinite, and the question arises of the maximum 
work which the body can do for a given change in its state. 

If a body does work on an external object in a transition from one state 
to another, then in the reverse transition from the second state to the first 
some external source of work must do work on the body. A transition in 
which the body does the maximum work | R|,,,, corresponds to a reverse 
transition which requires the external source to do the minimum work R,,,- 
These must obviously be the same, so that the calculation of the one is equiv- 
alent to that of the other, and we shall speak below of the work done on 
the body by a thermally isolated external source of work. 

During the process, the body may exchange heat and work with the me- 
dium. The work done on the body by the medium must of course be sub- 
tracted from the total work done on the body, since we are concerned only 
with the work done by the external source. Thus the total change AE in the 
energy of the body in some (not necessarily small) change in its state con- 
sists of three parts: the work R done on the body by the external source, 
the work done by the medium, and the heat gained from the medium. As 
already mentioned, owing to the large size of the medium its temperature 
and pressure may be taken as constant, and the work done by it on the body 
is therefore Po AV, while the heat given up by it is ~T> AS (the suffix zero 
indicates quantities pertaining to the medium, while those for the body 
have no suffix). Thus 

AE = R+PpA4Vo—-To ASo. 


Since the total volume of the medium and the body remains constant, AV = 
—AYV, and the law of increase of entropy shows that 4S+4Sp) = 0; the 
entropy of the thermally isolated source of work does not vary. Thus AS > 
—AS. From R = AE—P»4V0+T0ASo we therefore find 


R>4AE-T)AS +Py AV. (20.1) 


The equality occurs for a reversible process. Thus we again conclude that 
the change occurs with minimum expenditure of work, and the reverse change 
with maximum work, if it occurs reversibly. The value of the minimum 
work is 

Ruin = A(E—ToS + PoV) (20.2) 


(T> and Po, being constants, can be placed after 4), i.e. this work is equal to 
the change in the quantity E—-T>S +P)V. For maximum work the formula 
must be written with the opposite sign: 


| Rlmax = —A(E—-T pS + PV), (20.3) 


since the initial and final states are interchanged. 
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If the body is in an equilibrium state at every instant during the process 
(but not, of course, in equilibrium with the medium), then for an infinitesi- 
mal change in its state formula (20.2) may be written differently. Substitut- 
ing dE = T dS—P dV in dR,,;, = dE—To dS + Po dV, we find 


dRmin = (T-To) dS—(P—P, o) dV. (20.4) 


Two important particular cases may be noted. If the volume and tempera- 
ture of the body remain constant, the latter being equal to the temperature 
of the medium, (20.2) gives R,,;, = J(E—TS), or 


Rein = AF 9 (20.5) 


i.e. the minimum work is equal to the change in the free energy of the body. 
Secondly, if the temperature and pressure of the body are constant and equal 


to Ty and Po, we have 
Rmin = AP, (20.6) 


i.e, the work done by the external source is equal to the change in the ther- 
modynamic potential of the body. 

It should be emphasised that in both these particular cases the body con- 
cerned must be one not in equilibrium, so that its state is not defined by T and 
V (or P) alone; otherwise, the constancy of these quantities would mean that 
no process could occur at all. We must consider, for example, a chemical 
reaction in a mixture of reacting substances, a process of dissolution, or the 
like. 

Let us now assume that a body in an external medium is left to itself and 
no work is done on it. Spontaneous irreversible processes will occur in the 
body and bring it into equilibrium. In the inequality (20.1) we must now put 
R = 0, and so 

A(E—T)S + PoV) « 0. (20.7) 


This means that the processes occurring in the body will cause the quantity 
E—T oS +PoV to decrease, and it will reach a minimum at equilibrium. 

In particular, for spontaneous processes at constant temperature T = T> 
and constant pressure P = Po, the thermodynamic potential ® of the body 
decreases, and for processes at constant temperature 7 = 79 and constant 
volume of the body its free energy F decreases. These results have already 
been derived by a different approach in § 15. It may be noted that the deri- 
vation given here does not essentially assume that the temperature and vol- 
ume (or pressure) of the body remain constant throughout the process: we 
may say that the thermodynamic potential (or free energy) of a body de- 
creases as a result of any process for which the initial and final temperature 
and pressure (or volume) are the same (and equal to the temperature and 
pressure of the medium), even if they vary during the process. 
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Another thermodynamic significance may also be ascribed to the minimum 
work. Let S, be the total entropy of the body and the medium. If the body 
is in equilibrium with the medium, S, is a function of their total energy E,: 


S; — S CE,). 
If the body is not in equilibrium with the medium, their total entropy differs 


from S,(E,) for the same value of the total energy E, by some amount 4S, 
< 0. In Fig. 3 the continuous line shows the function S,(Z,) and the vertical 


Er 


Fic. 3 


segment ab is —AS,. The horizontal segment bc is the change in the total 
energy when the body goes reversibly from the state of equilibrium with the 
medium to the state corresponding to the point b. In other words, this seg- 
ment represents the minimum work which must be done by some external 
source to bring the body from the state of equilibrium with the medium to 
the state considered; the equilibrium state in question (the point c in Fig. 
3) is, of course, not the same as that corresponding to the given value of E, 
(point a). 

Since the body is a very small part of the whole system, the processes 
involving it cause only a negligible relative change in the total energy and 
entropy. Figure 3 therefore shows that 


dS,(E;) 
AS, se Rae 
t dE, i 


But the derivative d£,/dS, is the equilibrium temperature of the system, i.e. 
the temperature To of the medium. Thus 


As, = —Bmin — 7 (4E-Te AS +Pp AV). (20.8) 
0 


This formula determines the amount by which the entropy of a closed system 
(body +medium) differs from its greatest possible value if the body is not in 
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equilibrium with the medium; 4E, AS and AV are here the differences be- 
tween the energy, entropy and volume of the body and their values in a 
state of complete equilibrium. 


§ 21. Thermodynamic inequalities 


In deriving the conditions of thermal equilibrium from that of maximum 
entropy, we have so far considered only the first derivatives. By equating to 
zero the derivatives with respect to energy and volume, we have deduced in 
§§ 9 and 12 the equality of temperature and pressure in all parts of the body as 
the conditions of equilibrium. But the vanishing of the first derivatives is 
only a necessary condition for an extremum and does not ensure that the 
entropy is in fact a maximum. The determination of the sufficient conditions 
for a maximum involves, of course, an examination of the second derivative 
of the function. 

Such an examination is, however, more conveniently carried out not from 
the condition of maximum entropy of a closed system but from another 
equivalent condition. Let us consider some small but macroscopic part of 
the body concerned. With respect to this part, the remainder of the body may 
be regarded as an external medium. Then, as shown in § 20, we can state that 
in equilibrium the quantity 


E-—T oS +PoV 


has a minimum, where E, S and V are the energy, entropy and volume of 
the part considered, and To, Po the temperature and pressure of the medium, 
i.e. of the remainder of the body. Clearly T> and Po are also the temperature 
and pressure of the part considered when in equilibrium. 

Thus in any small deviation from equilibrium the change in the quantity 
E-—T,S+ PV must be positive, i.e. 


6E—T 6S +Po 6V > 0. (21.1) 


In other words, the minimum work which must be done to bring this part of 
the body from equilibrium to any neighbouring state is positive. 

In what follows the equilibrium values will be implied for any coefficients 
appearing in the deviations of thermodynamic quantities from their equi- 
librium values, and the zero suffixes will therefore be omitted. 


t As regards the dependence of the entropy on the momenta of macroscopic 
motion, we have already investigated the conditions to be imposed on both the 
first and the second derivatives (§ 10), obtaining in this way the conditions that 
internal macroscopic motions in the.body be absent and that the temperature be 
positive. 


6 
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Expanding 6E as a series (regarding E as a function of S and V), we have 
as far as the second-order terms 


1 


og = SE as 428 5 V+ 


2 apt 2 
EE (6S)?-+2—— <<? ssav+2 7 Ov} 
But 0£/0S = T, 0E/0V = —P, so that the first-order terms are T 6S—P 6V, 
and cancel when 6£ is substituted in (21.1). Thus we obtain the condition 


OE 


oo (65 +2 seer asap oS vor = Ov) >0. (21.2) 


If such an inequality holds for arbitrary 6S and 6V, two conditions must 
be satisfied :* 


B7E/OS? > 0, (21.3) 
GE VE / BE \? 
Os ay - (agar) = a 


Since 
OE/dS2 = (8T/dS)y = T/C,, 


the condition (21.3) becomes 7/C,, > 0, or 
C, > 0, (21.5) 


i.e. the specific heat at constant volume is always positive. 
The condition (21.4) may be written in terms of the Jacobian 


O[(OE/OS)v, (BE/OV)s] 
ee a —a(T, P)/a(S, V) > 0. 


Changing to the variables T and V, we have 


aT, P) _ a, P/OT,V) _ (P/V)r __ T (aP\ _ 4 
a5,V) AS, Vat, Vv) (S/T Cy (ar). 


Since C,, > 0, this is equivalent to the condition 
(oP/eV )r < 0, (21.6) 


ie. an increase in volume at constant temperature is always accompanied by 
a decrease in pressure. 

The conditions (21.5) and (21.6) are called thermodynamic inequalities. 
States in which these conditions are not satisfied are unstable and cannot 
exist in Nature. 


t The special case where the equality sign holds in (21.4) will be discussed in 
§ 152. 
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It has already been noted in§ 16 that from the inequality (21.6) and formu- 
la (16.10) we always have C, > C,. From (21.5) we can therefore conclude 
that 

Cp > 0 (21.7) 
always. 

The fact that C, and C, are positive means that the energy isa monoton- 
ically increasing function of temperature at constant volume, and the heat 
function behaves similarly at constant pressure. The entropy increases mono- 
tonically with temperature at either constant volume or constant pressure. 

The conditions (21.5), (21.6), which have been derived for an arbitrary 
small part of a body, are of course valid for the whole body also, since in 
equilibrium the temperatures and pressures of all parts of the body are the 
same. Here it is assumed that the body is homogeneous (only such bodies 
have been considered so far). It must be emphasised that the fulfilment of 
the conditions (21.5), (21.6) depends on the homogeneity of the body. We 
can, for example, consider a body whose particles are held together by gravi- 
tational forces. Such a body will clearly be inhomogeneous, having a higher 
density towards the centre, and the specific heat of the body as a whole may 
be less than zero, so that its temperature rises as its energy decreases. We 
may note that this does not contradict the result that the specific heat is 
positive for every small part of the body, since in these conditions the energy 
of the whole body is not equal to the sum of the energies of its parts; there 
is also the energy of the gravitational interaction between these parts. 

The inequalities derived above are conditions of equilibrium, but their 
fulfilment is not sufficient for the equilibrium to be completely stable. There 
can exist states such that the entropy decreases for an infinitesimal deviation 
from the state and the body then returns to its initial state, whereas for a finite 
deviation the entropy may be greater than in the original state. After such 
a finite deviation the body does not return to its original state, but will tend 
to pass to some other equilibrium state corresponding to a maximum entropy 
greater than that in the original state. Accordingly, we must distinguish 
between metastable and stable equilibrium states. A body in a metastable 
state may not return to it after a sufficient deviation. Although a metastable 
state is stable within certain limits, the body will always leave it sooner or 
later for another state which is stable, corresponding to the greatest of the 
possible maxima of entropy. A body which is displaced from this state will 
always eventually return to it. 


§ 22. Le Chatelier’s principle 


Let us consider a closed system consisting of a body and a medium 
surrounding it. Let S be the total entropy of the system, and y a quantity 


6* 
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pertaining to the body, such that the condition for S to be a maximum rela- 
tive to y, i.e. OS/dy = 0, signifies that the body itself isin equilibrium, though 
it is not necessarily in equilibrium with the medium. Also, let x be another 
thermodynamic quantity pertaining to the same body, such that if both 
0S/dy = 0 and dS/dx = 0 the body is not only in internal equilibrium but 
also in equilibrium with the medium. 

We shall use the notation 


X=—@S/ox, Y =—asS/dy. (22.1) 


In complete thermodynamic equilibrium the entropy S must be a maximum. 
For this, besides the conditions 


the conditions ans Une (22.2) 
(8X/8x)y > 0, (OY /ay)x > 0 (22.3) 
and | 
(&), (),- [(e),] > ° a 


must be satisfied. 

Let us now assume that the equilibrium of the body with the medium is 
destroyed by some small external interaction, the quantity x being somewhat 
changed and the condition ¥ = 0 no longer satisfied; we assume that » is 
not directly affected by the interaction in question. Let the change in x be 
Ax. Then the change in X at the instant of interaction is 


(AX)y = (6X/0x), Ax. 


The change in x at constant y leads, of course, to a violation of the condi- 
tion Y = 0 also, i.e. of internal equilibrium of the body. When equilibrium 
is again restored, the quantity X = AX will be 


(AX) ymo0 = (8X/8X)r=0 4x, 


where the derivative is taken at constant Y(= 0). 
To compare the two values of 4X, using the properties of Jacobians, we 
have 


(=) _ X,Y) _ AX, Y/axy) _ (=) ee [OX /ay),? 
Y=0 . y 


Ox a Vax (OY /dy)x 


The denominator of the second term in this expression is positive by the 
condition (22.3); using also (22.4), we find that 
(8X/0x)y > (OX/Ox)ymo > 0, (22.5) 


or 
[(AX)y| > (AX) veo]. (22.6) 
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The inequality (22.5) or (22. 6) forms the content of what is called Le 
Chatelier’s principle. ' 

We shall regard the change 4x of the quantity x as a measure of the ex- 
ternal interaction acting on the body, and AX as a measure of the change in 
properties of the body resulting from this interaction. The inequality (22.6) 
shows that, when the internal equilibrium of the body is restored after the 
external interaction which disturbed it, the value of 4X is reduced. Thus 
Le Chatelier’s principle may be formulated as follows: an external interac- 
tion which disturbs the equilibrium brings about processes in the body which 
tend to reduce the effects of this interaction. 

The above may be illustrated by some examples. 

First of all, it is convenient to modify somewhat the definition of the 
quantities X¥ and Y by using formula (20.8), according to which the change in 
entropy of the system (medium +body) is — R,,;,/To, where Ty is the temper- 
ature of the medium and R,,;, the minimum work needed to bring the body 
from a state of equilibrium with the medium to the state in question. We can 
therefore write 


Ys. 2. (22.7) 


For an infinitesimal change in the state of the body we have (see (20.4)) 


here and below all quantities without suffix relate to the body, and those 
with suffix 0 to the medium. 

Let x be the entropy S of the body. Then X¥ = (T—To)/To. The equilibrium 
condition X = 0 gives T = To, i.e. the temperatures of the body and the 
medium are equal. The inequalities (22.5) and (22.6) become 


(8T/OS)y > (OT/BS)ymo > 0, (22.8) 
(AT), | > [(AT)y=ol- (22.9) 


The significance of these inequalities is as follows. The change in x (the 
entropy of the body) means that a quantity of heat is given to or taken from 
the body. This destroys the equilibrium of the body itself and, in particular, 
changes its temperature by (AT),. The restoration of equilibrium in the 
body has the result that the absolute value of the change in temperature 
decreases, becoming (A7)y.9, i.e. it is as if the result of the interaction 
which brings the body out of equilibrium were reduced. We can say that 
heating or cooling a body brings about processes in it which tend to lower 
or raise the temperature respectively. 
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Now let x be the volume V of a body. Then X¥ = —(P—P)/To. In equi- 
librium X = 0, ie. P = Po. The inequalities (22.5) and (22.6) give 


(OP/aV)y < (OP/OV)y—0 < 0, (22,10) 
\(AP)y! > |(4P)y=ol. (22.11) 


If the body is disturbed from equilibrium by a change in its volume at 
constant temperature, then, in particular, its pressure is changed; the res- 
toration of equilibrium in the body leads to a decrease in the absolute value 
of the change in pressure. Since a decrease in the volume of the body causes 
an increase in its pressure, and vice versa, we can say that decreasing or 
increasing the volume of a body brings about processes in it which tend to 
lower or raise the pressure respectively. 

Later we shall meet with numerous applications of these results (to solu- 
tions, chemical reactions and so on). 

It may also be noted that, if y in the inequalities (22.8) is taken to be the 
volume of the body, we have 


(OT/8S), = (OT/OS)y = T/Cy, 
(OT/8S)y-0 = (OT /OS)p = T/Cp, 


since the condition Y = 0 then denotes P = Po, i.e. constant pressure. 
Thus we again obtain the already familiar inequalities C, > C,, > 0. Simi- 
larly, if in (22.10) y is taken as the entropy of a body, the condition Y = 0 
implies that the temperature is Constant, T = To, and we find 


(OP/OV)s < (P/V )r < 0, 


another result already known. 


§ 23. Nernst’s theorem 


The fact that the specific heat C,, is positive means that the energy is a 
monotonically increasing function of the temperature. Conversely, when 
the temperature falls the energy decreases monotonically, and therefore, 
when the temperature has its least possible value, i.e. at absolute zero, a body 
must be in the state of least possible energy. If we regard the energy of a 
body as the sum of the energies of the parts into which it may be imagined to 
be divided, we can say that each of these parts will also be in the state of 
least energy; it is clear that the minimum value of the sum must correspond 
to the minimum value of each term. 

Thus at absolute zero any part of the body must be in a particular quan- 
tum state, the ground state. In other words, the tsatistical weights of these 
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parts are equal to unity, and therefore so is their product, i.e. the statistical 
weight of the macroscopic state of the body as a whole. The entropy of the 
body, being the logarithm of its statistical weight, is therefore zero. 

We consequently reach the important result that the entropy of any body 
vanishes at the absolute zero of temperature. This is called Nernst’s theorem 
(W. Nernst, 1906). 

It should be emphasised that this theorem is a deduction from quantum 
statistics, in which the concept of discrete quantum states is of essential 
importance. The theorem cannot be proved in purely classical statistics, 
where the entropy is determined only to within an arbitrary additive con- 
stant (see § 7). 

Nernst’s theorem enables us to draw conclusions also concerning the 
behaviour of certain other thermodynamic quantities as T — 0. 

For instance, it is easy to see that for T = 0 the specific heats C, and C, 
both vanish: 


C,=C,=0 for T=0. (23.1) 
This follows immediately from the definition of the specific heat in the form 


C = T0dS/oT 
= 08/0 log T. 


When T —~ 0, log J -- — -, and since S tends to a finite limit, namely zero, 
it is clear that the derivative tends to zero. 
The thermal expansion coefficient also tends to zero: 


(OV/oT)p=0 for T=0. (23.2) 


For this derivative is equal to the derivative —(0S/8P),z (see (16.4)), which 
vanishes for T = 0, since S = 0 for T = 0 and any pressure. 
Similarly, we can see that 


(OP/eT)y=0 for T=0. (23.3) 


The entropy usually vanishes, for 7 - 0, according to a power law, Le. 
as S = aT”, where a is a function of pressure or volume. In this case, clearly, 
the specific heats and (6V/6T)p, (OP/OT), will tend to zero in the same way 
(with the same value of n). 


* To avoid misunderstandings we should emphasise that this refers to the tem- 
perature tending to zero with other conditions remaining unchanged—say at con- 
stant volume, or at constant pressure. If, on the other hand, the temperature of a gas 
tends to zero while its density decreases without limit, for example, the entropy 
need not tend to zero. 
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Finally, it may be seen that the difference C,—C, tends to zeromore 
rapidly than the specific heats themselves, i.e. 


(C,—-C,)/Cp=0 for T=0. (23.4) 


For let the entropy tend to zero as S oc J” for T -> 0. From formula (16.9) 
we then see thatC,—C, oc T*"**, so that (C,—C,)/C, oc T"*"; it should be 
borne in mind that the compressibility (OV/OP); is in general finite and not 
zero when T = 0. 

If the specific heat of a body is known for all temperatures, the entropy 
can be calculated by integration, and Nernst’s theorem gives the value 
of the constant of integration. For example, the dependence of the entropy 
on témperature for a given pressure is determined by 


T 
S= [(C,/T) dr. (23.5) 
0 
The corresponding formula for the heat function is 
T 
W = Wo+fC, 47, (23.6) 
0 


where W, is the value of the heat function for T = 0. Similarly, for the 
thermodynamic potential ® = W—TS we have 


T 
@= Wot ie Oo Niegat (23.7) 
8 


§ 24. The dependence of the thermodynamic quantities on the number of 
particles 


As well as the energy and entropy, such thermodynamic quantities as F, ® 
and W also have the property of additivity, as follows directly from their 
definitions if we bear in mind that the pressure and temperature are constant 
throughout a body in equilibrium. From this property we can draw certain 
conclusions concerning the manner in which each of these quantities depends 
on the number of particles in the body. Here we shall consider bodies con- 
sisting of identical particles (molecules); all the results can be immediately 
generalised to mixtures of different particles (see § 85). 

The additivity of a quantity signifies that, when the amount of matter 
(and therefore the number N of particles) is changed by a-given factor, the 
quantity is changed by the same factor. In other words, we can say that an 
additive thermodynamic quantity must be a homogeneous function of the 
first order with respect to the additive variables. 
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Let us express the energy of the body as a function of the entropy, vol- 
ume, and number of particles. Since S and V are themselves additive, this 
function must be of the form 


E = Nf(SIN, VIN), (24.1) 


the most general homogeneous function of the first order in N, S and V. 

The free energy F is a function of N, T and V. Since the temperature is 
constant throughout the body, and the volume is additive, a similar argu- 
ment gives 


F = Nf(V/N, T). (24.2) 
In exactly the same way we have for the heat function W, expressed as a 
function of N, S and the pressure P, 
W = Nf(S/N, P). (24.3) 
Finally, the thermodynamic potential as a function of N, P and T is 
PD = Nf(P,T). (24.4) 


In the foregoing discussion we have essentially regarded the number of 
particles as a parameter which has a given constant value for each body. 
We shall now formally consider N as a further independent variable. Then 
the expressions for the differentials of the thermodynamic potentials must 
include terms proportional to dN. For example, the total differential of the 
energy will be written 


dE = TdS—P dV +u dN, (24.5) 
where u denotes the partial derivative 


The quantity yu is called the chemical potential of the body. Similarly we 
have 


dW = TdS+V dP+u dN, (24.7) 
dF =-—SdT-—PdV+ydN, (24.8) 
d® =—SdT+V dP+udN, (24.9) 


with the same u. These formulae show that 
6 = (OW/ON)s, p = (OF /ON)r,v = (OP/ON)p, r, (24.10) 


i.e. the chemical potential can be obtained by differentiating any of the 
quantities E, W, F and ® with respect to the number of particles, but the 
result is expressed in terms of different variables in each case. 
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Differentiating ® in the form (24.4), we find that u = 0®/ON = f(P, T), 
i.e. 
@ = Nu. (24.11) 
Thus the chemical potential of a body (consisting of identical particles) is 
just its thermodynamic potential per molecule. When expressed as a function 
of P and T, the chemical potential is independent of N. Thus we can imme- 
diately write down for the differential of the chemical potential 


du =—sdT+ dP, (24.12) 


where s and v are the entropy and volume per molecule. 

If we consider (as we have usually done hitherto) a definite amount of 
matter, the number of particles in it is a given constant, while the volume 
is variable. Let us now take a certain volume within the body, and consider 
the matter enclosed therein; the number of particles N will now be variable, 
and the volume V constant. Then, for example, equation (24.8) reduces to 


dF =—SdT+udn. 


Here the independent variables are T and N. We may define a thermodynamic 
potential such that the second independent variable is u, not N. To do so, 
we substitute u dN = d(uN)—N du, obtaining 


d(F—uN) =—S dT—N du. 


But uN = ©, and F—® =—PV. Thus the new thermodynamic potential 
(denoted by 2) is just 
Q=-PV, (24.13) 
and 
dQ =—S dT—N du. (24.14) 


The number of particles is obtained by differentiating 2 with respect to the 
chemical potential at constant temperature and volume: 


N =—(02/du)r,v = V(OP/Oy)r, r- (24.15) 


In the same way as we proved the equality of small changes in E, W, F 
and ® (with the appropriate pairs of quantities constant), we can easily show 
that the change (62), ,, , at constant T, u, V has the same property: 


(6E)s, v,n = (OF )r,v,n = (6®)r,p,n = (6W)s,p.n = (62)7,7,n- (24.16) 


These equations refine and extend the theorem of small increments (15.12). 

Finally, as in §§ 15 and 20 for the free energy and the thermodynamic 
potential, we may show that the work in a reversible process occurring at 
constant T, V and wu is equal to the change in the potential 2. In a state of 
thermal equilibrium the potential 2 is a minimum with respect to any change 
of state at constant T, V, mu. 
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PROBLEM 
Derive an expression for the specific heat C, in terms of the variables T, u, V. 


SOLUTION. We transform the derivative C, = T(0S/0T)y,n to the variables T, 
V, us, writing (with V regarded as a constant throughout) 
os OCS, NY) O(s, N)/O(T, a2 chy 2 (0S/0)7 (ON /OT), 
(ar). ~ OTN) Or, N/A. uw) = (57), ON/Op)r 


But (0S/Ou)7 = —07Q/OT Op = (ON/OT),, and therefore 


8S\ _{@N/OT),F 
a "\(5z),7 GN f 


§ 25. Equilibrium of a body in an external field 


Let us consider a body in an external field which is constant in time. The 
different parts of the body are in different conditions, and the body will 
therefore be inhomogeneous. One of the conditions of equilibrium of such a 
body is again that the temperature should be constant throughout it, but the 
pressure will now vary from point to point. 

To derive the second condition of equilibrium, let us consider two ad- 
joining volumes in the body and maximise their entropy S = S1+S2 when 
the remainder of the body is in a fixed state. One necessary condition for a 
maximum is that the derivative GS/0N1 should be zero. Since the total 
number of particles N1+ Ne in these two parts of the body is regarded as 
constant, we have 


shows that the derivative 0S/ON for constant E and Tis — u/T. Thus 41/7; = 
42/T2. But in equilibrium 7, = T2, so that 1 = fe. We therefore conclude 
that in equilibrium in an external field, in addition to the constancy of 
temperature, we must have 

f4 = constant, (25.1) 


i.e. the chemical potential of every part of the body must be the same. The 
chemical potential of each part is a function of its temperature and pressure, 
as well as of the parameters which define the external field. If there is no field, 
the constancy of u and T necessarily implies that of the pressure. 
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In a gravitational field the potential energy u of a molecule is a function 
only of the coordinates x, y, z of its centre of gravity (and not of the ar- 
rangement of the atoms within the molecule). In this case the change in the 
thermodynamic quantities for the body amounts to adding to its energy the 
potential energy of the molecules in the field. In particular, the chemical 
potential (the thermodynamic potential per molecule) has the form u = fo+ 
u(x, y, Zz), where fo(P, 7) is the chemical potential in the absence of the 
field. Thus the condition of equilibrium in a gravitational field may be 


written 
fo P, T) +u(x, y, Z) = constant. (25.2) 


In particular, in a uniform gravitational field u = mgz (where m is the mass 
of a molecule, g the acceleration due to gravity, and z the vertical coordinate). 
Differentiating equation (25.2) with respect to the coordinate z at constant 
temperature, we have v dP = —mg dz, where v = (Ou,/OP)z is the specific 
volume. For small changes in pressure, v may be regarded as constant. 
Substituting the density @ = m/v and integrating, we obtain 


P = constant— ogz, 


the customary formula for the hydrostatic pressure in an incompressible 
fluid. 


§ 26. Rotating bodies 


In a state of thermal equilibrium, as we have seen in § 10, only a uniform 
translational motion and a uniform rotation of a body as a whole are pos- 
sible. The uniform translational motion needs no special treatment, since by 
Galileo’s relativity principle it has no effect on the mechanical properties 
of the body, nor therefore on its thermodynamic properties, and the thermo- 
dynamic quantities are unchanged except that the energy of the body is 
increased by its kinetic energy. 

Let us consider a body in uniform rotation round a fixed axis with angular 
velocity 2. Let E(p, g) be the energy of the body in a fixed coordinate system 
and E’(p, qg) the energy in a coordinate system rotating with the body. We 
know from mechanics that these quantities are related by 


E"(p, 9) = E(p, q¢)-2-M(p, 9), (26.1) 
where M(p, q) is the angular momentum of the body.' 


t See Mechanics, § 39. Although the derivation of formula (39.13) is based on 
classical mechanics, in quantum theory exactly the same relations apply to the 
operators of the corresponding quantities. Hence all the thermodynamic relations 
derived below are independent of which mechanics describes the motion of the 
particles in the body. 
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Thus the energy E’(p, g) depends on the angular velocity 2 as a parameter, 
and 
OF (p, g)/02 = —-M(y, q). 
Averaging this equation over the statistical distribution and using formula 
(11.3), we obtain 
(OE’/02)s = —M, (26.2) 
where E” = E’(p,q), M = M(p, 9) are the mean (thermodynamic) energy 


and angular momentum of the body. 
From this relation we can write down the differential of the energy of a 


rotating body of given volume: 
dE’ = TdS—M-d2Q. (26.3) 


Similarly, for the free energy F’ = E’—TS (in the rotating coordinate sys- 
tem) we have 
dF’ = —S dT—M-dQ. (26.4) 
Averaging equation (26.1) gives 
E’ = E-M.Q. (26.5) 
Differentiating this equation and substituting (26.3), we obtain the differen- 
tial of the energy in the fixed coordinate system: 


dE = TdS+2.dM. (26.6) 
Correspondingly, for the free energy F = E—TS 
dF = —SdT+2edM. (26.7) 


Thus in these relations the independent variable is not the angular velocity 
but the angular momentum, and 


Q = (BE/OM)s = (@F/8M)r. (26.8) 


As we know from mechanics, a uniform rotation is in a certain sense equiv- 
alent to the presence of two fields of force, centrifugal and Coriolis. The 
centrifugal forces are proportional to the size of the body, as they involve the 
distance from the axis of rotation; the Coriolis forces are independent of the 
size of the body. For this reason the effect of the Coriolis forces on the ther- 
modynamic properties of a rotating macroscopic body is entirely negligible 
in comparison with that of the centrifugal forces, and the former can usually 
be neglected. The condition of thermal equilibrium of a rotating body is 
therefore obtained by simply substituting for u(x, y, z)in (25.2) the centrifugal 
energy of the particles: 


oP, T)—4mQ*? = constant, (26.9) 


t It may be shown that in classical statistics the Coriolis forces do not affect 
the statistical properties of the body; see § 34. 
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where uo is the chemical potential of the body at rest, m the mass of a mole- 
cule, and r the distance from the axis of rotation. 

For the same reason, the total energy E of a rotating body may be written 
as the sum of its internal energy (here denoted by £,,) and its kinetic energy 
of rotation: 


E = Ejn-+M?/2I, (26.10) 


where J is the moment of inertia of the body with respect to the axis of rota- 
tion. It should be remembered that rotation in general changes the distribu- 
tion of mass in the body, and so the moment of inertia and internal energy 
of the body are themselves in general functions of 2 (or of M4). They may be 
regarded as constants independent of 2 only when the rotation is sufficiently 
slow. 

Let us consider an isolated uniformly rotating solid with a given mass 
distribution. Since the entropy of a body is a function of its internal energy, 
we have in this case S = S(E—M?/21). Because the body is a closed system, 
its total energy and angular momentum are conserved, and the entropy must 
have the maximum value possible for the given M and E. We therefore con- 
clude that the equilibrium rotation of the body takes place about the axis 
with respect to which the moment of inertia has the greatest possible value. 
This assumes that the axis of rotation is necessarily a principal axis of inertia 
of the body, but the latter result is evident: if the body rotates about an axis 
other than a principal axis of inertia, then, as we know from mechanics, the 
axis of rotation will itself precess in space, and the rotation will be non- 
uniform, and therefore not an equilibrium rotation. 


§ 27. Thermodynamic relations in the relativistic region 


Relativistic mechanics leads to a number of changes in the usual thermo- 
dynamic relations. Here we shall discuss the most interesting of these changes. 
If the microscopic motion of the particles forming a body becomes relativ- 
istic, the general thermodynamic relations are unchanged, but there is an 
important inequality between the pressure and energy of the body: 


P< E/3v, (27.1) 


where E is the energy of the body including the rest energy of the particles 
in it. 


t See Fields, § 35. It should be mentioned, however, that there is as yet no general 
proof of this inequality valid for all types of interaction (not only electromagnetic) 
which exist between particles in Nature. 
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The changes caused by the general theory of relativity in the conditions of 
thermal equilibrium, taking account of the gravitational field of the body 
itself, are of fundamental importance. Let us consider a macroscopic body 
at rest; its gravitational field is, of course, constant. In a constant gravi- 
tational field we must distinguish the conserved energy Ey of any small part 
of the body from the energy E measured by an observer situated at a given 
point. These two quantities are related by 


Eo = EV200; 


where Zoo is the time component of the metric tensor; see Fields, § 88, 
formula (88.9) with v = 0, mc? = E. But, from the sense of the proof given 
in § 9 that the temperature is constant throughout a body in equilibrium, it 
is clear that the quantity obtained by differentiating the entropy with re- 
spect to the conserved energy Ey must be constant. The temperature 7’ 
measured by an observer situated at a given point in space is, however, 
obtained by differentiating the entropy with respect to the energy E, and 
will therefore be different at different points in the body. 

To derive a quantitative relation, we note that the entropy, by definition, 
depends only on the internal state of the body and so is unchanged by the 
presence of a gravitational field (provided that this field does not affect the 
internal properties of the body, a condition which is always satisfied in prac- 
tice). The derivative with respect to entropy of the conserved energy Ep is 
therefore T -/goo, and so one of the conditions of thermal equilibrium is 
that 

T /200 = constant (27.2) 
throughout the body. 

A similar change occurs in the second condition of equilibrium, the con- 
stancy of the chemical potential. The latter is defined as the derivative of the 
energy with respect to the number of particles. Since this number is of course 
unaffected by a gravitational field, we have for the chemical potential meas- 
ured at any given point a relation of the same kind as for the temperature: 


Le+/Z00 = Constant. (27.3) 
We may note that the relations (27.2), (27.3) may be written 


T = constant X dx°/ds, 


4 = constant X dx°/ds, Gia 


which enable us to consider the body not only in the frame of reference in 
which it is at rest but also in those where it is moving (rotating as a whole). 
The derivative dx°/ds must be taken along the world line described by the 
point considered in the body. 
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In a weak (Newtonian) gravitational field, goo = 1+2¢/c?, where @ is 
the gravitational potential (see Fields, § 87). Substituting this expression in 
(27.2) and taking the square root, we have to the same approximation 


T = constant X (1— /c?). (27.5) 


Since ¢ < 0, this shows that in equilibrium the temperature is higher at 
points in the body where | | is greater, i.e. within the body. In the limit of 
non-relativistic mechanics (c¢ --), (27.5) becomes T = constant, as it 
should. 

We can similarly transform the condition (27.3), bearing in mind that the 
relativistic chemical potential, in the limit of classical mechanics, does not 
directly become the ordinary (non-relativistic) expression for the chemical 
potential in the absence of a field, which we now denote by so, but fo+me?, 
where mc? is the rest energy of a particle of the body. Thus we have 


/00 & (Ho-+me*) (1 + b/c?) 
x Uo +mc* +md, 
so that the condition (27.3) becomes 


fo-+md = constant; 


this agrees with (25.2), as it should. 

Finally, we may mention a useful relation which follows immediately from 
the conditions (27.2) and (27.3). Dividing one by the other, we find that 
u/T = constant, and hence 


du/u = d7/T. 
From (24.12), at constant volume (equal to unity) we have 
dP = SdT+N du, 


where S and N are the entropy and number of particles in unit volume of the 
body. Substituting dT = (T/) du and noting that uN+ST = © + ST = +P, 
where e is the energy per unit volume, gives‘ 


du/u = dP/(e+P). (27.6) 


t In the non-relativistic case, putting 4 =~ mc?, € = ec* >> P (where 0 is the 
density), we get du = v dP with v = m/o the volume per particle; this is as it 
should be for T = constant. 


CHAPTER III 


THE GIBBS DISTRIBUTION 


§ 28. The Gibbs distribution 


LT us now turn to the problem stated in Chapter I of finding the distribution 
function for a subsystem, i.e. any macroscopic body which is a small part of 
some large closed system. The most convenient and general method of 
approaching the solution of this problem is based on the application of the 
microcanonical distribution to the whole system. 

Distinguishing the body in question from the rest of the closed system, we 
may consider the system as consisting of these two parts. The rest of the 
system will be called the “medium” in relation to the body. 

The microcanonical distribution (6.6) can be written in the form 


dw = constantX 6(E+E’—B) dl dl”, (28.1) 


where E, dl and E’, dI” relate to the body and the medium respectively, and 
E© is the given value of the energy of the closed system, which must be equal 
to the sum E'+ E’ of the energies of the body and the medium. 

Our object is to find the probability w, of a state of the whole system 
such that the body concerned is in some definite quantum state (with energy 
E,), i.e. a microscopically defined state. The microscopic state of the medium 
is of no interest, so that we shall suppose this to be in a macroscopically 
defined state. Let AI” be the statistical weight of the macroscopic state of the 
medium and let AE’ be the range of values of the energy of the medium 
corresponding to the range AJ” of quantum states in the sense discussed in 
§ 7. 

The required probability w, can be found by taking dJ’ = 1 in (28.1), 
putting E = E, and integrating with respect to I”: 

W, = constantX j O(£, +E’ —E) dl’. 
Let I’(E’) be the total number of quantum states of the medium with energy 
not exceeding E’. Since the integrand depends only on E’, we can change to 
integration with respect to E’, putting dI” = (dI’(E)/dE’)dE’. The deriva- 
tive dI”/dE’ is replaced (cf.§ 7) by 
dI’/dE’ = eS AE’, 
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where S’(£’) is the entropy of the medium as a function of its energy; AE’ is, 
of course, also a function of E’. Thus we have 


- 


es 
AE’ 


Wa = constant { O(E’ +£,,—-E) dE’. 


Owing to the presence of the delta function, the result of the integration is 
simply to replace E’ by E®—E,: 


- 


es 
w, = constant X |-——, ° 
(az E’mE(0)—E, 


We now use the fact that, since the body is small, its energy £, is small in 
comparison with E®. The quantity JE’ undergoes only a very small relative 
change when E’ varies slightly, and so in AE’ we can simply put E’ = E®; 
it then becomes a constant independent of E,. In the exponential factor e*, 
we must expand S’(E —E,) in powers of E, as far as the linear term: 


S'(E—E,) = S'(E)—E, dS'(E)/dE®. 


(28.2) 


The derivative of the entropy S’ with respect to energy is just 1/T, where T 
is the temperature of the system; the temperatures of the body and the medi- 
um are the same, since the system is assumed to be in equilibrium. 

Thus we have finally for w,, the expression 


w, = Ae~EwT , (28.3) 


where A is a normalisation constant independent of E,. This is one of the 
most important formulae in statistical physics. It gives the statistical distribu- 
tion of any macroscopic body which is a comparatively small part of a large 
closed system. The distribution (28.3) is called the Gibbs distribution or canon- 
ical distribution; it was discovered by J. W. Gibbs for classical statistics in 
1901. 

The normalisation constant A is given by the condition Zw, = 1, whence 


1 
A 
th He 
The mean value of any physical quantity f pertaining to the body can be 
calculated by means of the Gibbs distribution, using the formula 


f = » Wifnn 
= V fie EWT » o-FulT, (28.5) 


= Sem, (28.4) 
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In classical statistics an expression exactly corresponding to (28.3) is 
obtained for the distribution function in phase space: 


o(p, gq) = Ae~F@. nT, (28.6) 


where E(p, q) is the energy of the body as a function of its coordinates and 
momenta.’ The normalisation constant A is given by the condition 


fodpdg = A fe#.0'T dp dg = 1. (28.7) 


In practice, cases are frequently encountered where it is not the entire 
microscopic motion of the particles which is quasi-classical, but only the 
motion corresponding to some of the degrees of freedom, whereas the motion 
with respect to the remaining degrees of freedom is quantised (for example, 
the translational motion of the molecules may be quasi-classical while the 
motion of the atoms within the molecules is quantised). Then the energy 
levels of the body may be written as functions of the quasi-classical co- 
ordinates and momenta: E, = E,(p, q), where n denotes the set of quantum 
numbers defining the “quantised part” of the motion, for which p and q are 
parameters. The Gibbs distribution formula then becomes 


dw,(p, q) = Aew*al? WT dog dda, (28.8) 


where dp,,dq,, is the product of differentials of the “quasi-classical” co- 
ordinates and momenta. 

Finally, the following comment is necessary concerning the group of prob- 
lems which may be solved by means of the Gibbs distribution. We have spo- 
ken of the latter throughout as the statistical distribution for a subsystem, as 
in fact it is. It is very important to note, however, that this same distribution 
can quite succesfully be used also to determine the fundamental statistical 
properties of bodies forming closed systems, since such properties of a body 
as the values of the thermodynamic quantities or the probability distributions 
for the coordinates and velocities of its individual particles are clearly inde- 
pendent of whether we regard the body as a closed system or as being placed 
in an imaginary thermostat (§ 7). But in the latter case the body becomes a 
“subsystem” and the Gibbs distribution is immediately applicable to it. The 
difference between bodies forming closed and non-closed systems when the 
Gibbs distribution is used appears essentially only in the treatment of the 
fairly unimportant problem of fluctuations in the total energy of the body. 
The Gibbs distribution gives for the mean fluctuation of this quantity a non- 


t To avoid misunderstanding, let us mention once more that the w, (or i) are 
monotonic functions of energy and need not have maxima for E= E. It is the 
distribution function with respect to energy, obtained by multiplying w,, by 
dI(£)/dE, wich has a sharp maximum at E = E£. 
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zero value, which is meaningful for a body in a medium but is entirely spu- 
tious for a closed system, since the energy of such a body is by definition 
constant and does not fluctuate. 

The possibility of applying the Gibbs distribution (in the manner described) 
to closed systems is also seen from the fact that this distribution hardly differs 
from the microcanonical distribution, while being very much more conven- 
ient for practical calculations. For the microcanonical distribution is, roughly 
speaking, equivalent to regarding as equally probable all microstates of the 
body which correspond to a given value of its energy. The canonical distribu- 
tion is “spread” over a certain range of energy values, but the width of this 
range (of the order of the mean fluctuation of energy) is negligible for a 
macroscopic body. 


§ 29. The Maxwellian distribution 


The energy E(p, g) in the Gibbs distribution formula of classical statistics 
can always be written as the sum of two parts: the kinetic energy and the 
potential energy. The first of these is a quadratic function of the momenta of 
the atoms,’ and the second is a function of their coordinates, the form of 
which depends on the law of interaction between the particles within the body 
(and on the external field, if any). If the kinetic and potential energies are 
denoted by K(p) and U(q) respectively, then E(p, g) = K(p)+ U(q), and the 
probability dw = oe(p, g) dp dg becomes 


dw = Ae-U@VTe-K@)T dp dq, 


ie. is the product of two factors, one of which depends only on the co- 
ordinates and the other only on the momenta. This means that the probabil- 
ities for momenta and coordinates are independent, in the se nse that any 
particular values of the momenta do not influence the probabilities of the 
various values of the coordinates, and vice versa. Thus the probability of the 
various values of the momenta can be written 


dw, = ae—*leyT dp, (29.1) 
and the probability distribution for the coordinates is 
dw, = be-Y@/T dg, (29.2) 


Since the sum of the probabilities of all possible values of the momenta 
must be unity (and the same applies to the coordinates), each of the probabil- 
ities dw, and dw, must be normalised, i.e. their integrals over all possible 


t It is assumed that Cartesian coordinates are used. 
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values of the momenta and coordinates respectively for the body concerned. 
must be equal to unity. From these conditions we can determine the constants 
a and b in (29.1) and (29.2). . 

Let us consider the probability distribution for the momenta, and once 
again emphasise the very important fact that in classical statistics this distri- 
bution does not depend on the nature of the interaction of particles within 
the system or on the nature of the external field, and so can be expressed in 
a form applicable to all bodies.* 

The kinetic energy of the whole body is equal to the sum of the kinetic 
energies of each of the atoms composing it, and the probability again falls 
into a product of factors, each depending on the momenta of only one atom. 
This again shows that the momentum probabilities of different atoms are 
independent, i.e. the momentum of one does not affect the probabilities of 
various momenta of any other. We can therefore write the probability distri- 
bution for the momenta of each atom separately. 

For an atom of mass m the kinetic energy is (p+ p',+p;)/2m, where p,, 
Py, Pp, are the Cartesian components of its momentum, and the probability 
distribution is 

dw, = ae (Path, +P)/2"T dn. dp, dp, - 


The constant a is given by the normalisation condition. The integrations 
over p,, p, and p, are separate; by means of the formula 


i e-** dx = 4/(2/a) 


we find a = (2mmT)~*”, and the momentum probability distribution takes 
the final form 


1 


= amr? en (Prt Py+P;)/2mT dp, dp, dp. ; (29.3) 


dw, 


Changing from momenta to velocities (p = mv), we can write the corre- 
sponding velocity distribution as 


3/2 i 
m m(v2-+yt+y 
dw, = (5) e~ ( et yt 3)/2T dv, dv, dv, ° (29.4) 


This is the Maxwellian distribution (J.C. Maxwell, 1860). It again consists 
of a product of three independent factors 


— \V ™ o-mipr 
div, = Ve : do,, ..., (29.5) 


t In quantum statistics this statement is not true in general. 
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each of which gives the probability distribution for a single velocity compo- 
nent. 

If the body consists of molecules (e.g. a polyatomic gas), then together with 
the Maxwellian distribution for the individual atoms there is a similar distri- 
bution for the translational motion of each molecule as a whole: from the 
kinetic energy of the molecule we can separate a term which gives the energy 
of the translational motion, and so the required distribution separates in the 
form (29.4), where m must now be taken as the total mass of the molecule, 
and v,, v,, v, as the velocity components of its centre of mass. It should be 
emphasised that the Maxwellian distribution for the translational motion of 
molecules can be valid quite independently of the nature of the motion of the 
atoms within the molecule (and the rotation of the molecule), and in partic- 
ular when a quantised description of the latter is necessary." 

The expression (29.4) is written in terms of Cartesian coordinates in 
“velocity space”. If we change from Cartesian to spherical polar coordi- 
nates, the result is 

im 


3/2 
Zs : —mvt/2Ty2 of 
dw, ( mar) e-m'Ty? sin 9 dO db do, (29.6) 


where v is the absolute magnitude of the velocity, and 6 and ¢ the polar angle 
and azimuthal angle which determine the direction of the velocity. Integra- 
tion with respect to angle gives the probability distribution for the absolute 
magnitude of the velocity: 

m 


3/2 
“dw, = 4e( a7 e— mT y2 dy, (29.7) 


It is sometimes convenient to use cylindrical coordinates in velocity 
space. Then 


8/2 
dw, = (zn) en mostr)/ Ty, dv, dv, d@, (29.8) 


where v, is the velocity component along the z-axis, v, the component per- 
pendicular to that axis, and ¢ the angle which gives the direction of this 
component. 

Let us calculate the mean kinetic energy of an atom. According to the defi- 
nition of the mean, and using (29.5), we find for any Cartesian velocity com- 


t The Maxwellian distribution clearly applies also to the Brownian motion of 
particles suspended in a liquid. 


ea 
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ponent’ 


ve aes \25 [ oe-meer dv, 


The mean value of the kinetic energy of the atom is therefore 37/2. 
We can thus say that the mean kinetic energy of all the particles in the body 
in classical] statistics is always 3N7/2, where N is the total number of parti- 
cles. 

PROBLEMS 


PROBLEM. 1. Find the mean value of the nth power of the absolute magnitude 
of the velocity. 


SOLUTION. Using (29.7), we find 
m \*? ( /2Tyn+2 
vo" = 4n{ —— eet +2 
(=) | e id v 
2 (2T7\"? _(n+3 
=—({—}) I{——}. 
/a\m ( 2 
t For reference we shall give the values of the integrals of the form 


i f eo x dy, 
0 


which often occur in applications of the Maxwellian distribution. The substitution 
ax? = y gives 


ly = dg ™+D/a f ey" D2 dy 
0 


au 4a —™ DAL (Sn +3); 


where I(x) is the gamma function. In particular, if n = 2r with r > 0, then 


_ Qr-)' / 2 
lor = HT aati’ 


where (2r—1)!! = 1-3-5+ ... (2r—1). If 7 = 0, then 
= + f(a). 

If n = 2r+1, then 
teag = rifael 


The same integral from — o to + co is zero if m = 2r7+1 and twice the integral 
from 0 to oo ifm = 2r. 
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In particular, if 2 is even (= 27), then 
vo = (T/my (2r-+1)!!; 
if m = 2r+1, then 
peti = (r+1)!. 


2 OT \ (er +D/2 
(mn) 


a/% 

PROBLEM 2. Find the mean square fluctuation of the velocity. 

SOLUTION. The result of Problem 1 with n = 1 and n = 2 gives 
(doy?) = v2 —0 = (T/m) (3 —8/n). 


PROBLEM 3. Find the mean energy, the mean square energy, and the mean square 
fluctuation of the kinetic energy of an atom. 


SOLUTION. From the results of Problem 1 we find 
&= dmv? = 37/2, 
e? = 1577/4, 
(Ae?) = 37?/2. 
PROBLEM 4. Find the probability distribution for the kinetic energy of an atom. 


SOLUTION. 


2 
de = Ty eT ./e de. 


PROBLEM 5, Find the probability distribution for the angular velocities of rota- 
tion of molecules. 


SOLUTION. Just as for translational motion, we can write the probability distri- 
bution for the rotation of each molecule separately (in classical statistics). The 
kinetic energy of rotation of a molecule regarded as a rigid body (which is permis- 
sible, owing to the smallness of the atomic vibrations within the molecule) is 


1 iy 1 (M2 M2 M2 
frot = Fz APi+ A+ 423) => (+ = aa 


where /,, 1, J, are the principal moments of inertia, {2,, 25, (2, are the components 
of the angular velocity along the principal axes of inertia, and M, = 1,92,, Mz = 
1,925, Mz = 1,22, are the components of the angular momentum, which act as 
generalised momenta with respect to the velocities 2,, 2,, 2,. The normalised 
probability distribution for the angular-momentum components is . 


1 (M2 M2 M2 
dWyyq = (20T)- (hy Ing) ¥? exp le (+5 ae dM, dM, dM; 
1 


and for the angular velocity 


1 
dwo = (2x7) 3? (LJgl,)” exp |- oF 1,02+ 123+ 1,29| dQ, dQ, dQ. 


al 
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PROBLEM 6. Find the mean squares of the absolute magnitudes of the angular 
velocity and angular momentum of a molecule. 


. SOLUTION. The above distributions give 


M* = T(h+h+h). 


§ 30. The probability distribution for an oscillator 


Let us consider a body whose atoms are executing small oscillations about 
some equilibrium positions. They may be atoms in a crystal or in a gas mole- 
cule; in the latter case the motion of the molecule as a whole does not affect 
the oscillations of the atoms within it and so does not influence the results. 

As we know from mechanics, the Hamiltonian (the energy) of a system 
consisting of an arbitrary number of particles executing small oscillations can 
be written as a sum: 


E(p, 9) = ps (p2 + 2g), 


where q, are what are called the normal coordinates of the oscillations (equal 
to zero at points of equilibrium), p, = q¢, are the corresponding generalised 
momenta, and w, are the oscillation frequencies. In other words, E(p, q) is 
a sum of independent terms, each corresponding to a separate normal oscil- 
lation (or, as we say, to an oscillator). In quantum mechanics the same is true 
of the Hamiltonian operator of the system, so that each oscillator is independ- 
ently quantised and the energy levels of the system are given by the sums 


py fito,(na + $)s 


the n, being integers. 

As a result of these facts the Gibbs distribution for the whole system is a 
product of independent factors, each giving the statistical distribution for a 
separate oscillator. In consequence we shall consider a single oscillator in 
what follows. 

Let us determine the probability distribution for the coordinate g of an 
oscillator;' the suffix « which gives the number of the oscillator will be omit- 
ted henceforward. In classical statistics the solution to this problem would be 
very simple: since the potential energy of the oscillator is }.wg?, the probabil- 
ity distribution is 

dw, = Ae~“#PT dg, 


+ The normal coordinate has the dimensions cm-g. 
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or, determining 4 from the normalisation condition, 


@ 


V(2aT) 


the integration with respect to g may be taken from — co to + ©, since the 
intepral is rapidly convergent. 

Let us now consider the solution of this problem in the quantum case. Let 
»,(@) be the wave functions of the stationary states of the oscillator, corre- 
sponding to the energy levels 


eat dg; (30.1) 


dw, = 


En = hw(n+4). 


If the oscillator is in the nth state, the quantum probability distribution 
for its coordinate is given by y? (in the present case the functions yp, are real, 
and so we write simply ? instead of the squared modulus |, |"). The re- 
quired statistical probability distribution is obtained by multiplying ? by the 
probability w, of finding the oscillator in the nth state, and then summing over 
all possible states. 

According to the Gibbs distribution, 


Ww, = ae~wT , 


where a is a constant. Thus we have the formula 
dw, = adg ¥ ey, (30.2) 
nad 


which is, of course, entirely in agreement with the general formula (5.8). 
To calculate the sum, we can proceed as follows. With the notation dw, = 
e, dg, we form the derivative 


de, = dy, 
sigan: ee 2a —4,/T, nz 
dq one dq 

Using the momentum operator f = if'd/dg and the fact that the oscillator 
momentum has non-zero matrix elements (see/ Quantum Mechanics, § 23) 
only for transitions with n - n+1, we can write 


dy, i a 
dq H Pe 


i 


(Pn—1, nPn—1 + Pu+1, nPn+1) 


h 
@ 
a (Qn— nPn—1 — Fn+1, nPn+1)- 
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Here we have used the relations 
Pna-1,2 = —10Gn—1, ns Patina = 10Gn+1,n 
between the momentum and coordinate matrix elements. Thus 


= cat {3 Qn—1, nPntPn—1e wT — Danes nPripn3e— oF \ 
In the first sum we change the summation suffix from n to n+ 1 and use the 
relations 
Ent = En th, Qntiun = Inn+ts 9-1,0 = 0, 
obtaining 
dQq 2aw 


‘dq he (es) 2, In, n+iPripnzie—w? 
In an exactly similar manner we can prove that 
90, = al +e—*T) 2, Jn, n+1PnPng1e—™" 
az 


A comparison of the two equations gives 


whence 


= Bae ho 
Qq constant exp { Ts tanh cat 


Determining the constant from the normalisation condition, we finally obtain 
the formula 


1/2 
dw, = (5 tanh 7) exp {—d = tanh al dg (30.3) 


(F. Bloch, 1932). Thus in the quantum case also the probabilities of various 
values of the coordinate of an oscillator are distributed according to a law 
of the form exp (—«g”), but the coefficient « differs from that in the classical 
case. In the limit Aw « T, where the quantisation is unimportant, formula 
(30.3) becomes (30.1), as we should expect. 

In the opposite limiting case fw >> T, formula (30.3) becomes 


dwg = — exp (—q’w/h) dq, 
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i.e. the purely quantum probability distribution for the coordinate in the 
ground state of the oscillator.’ This corresponds to the fact that when 
T < hw the oscillations are hardly excited at all. 

The probability distribution for the momentum of the oscillator can be 
written by analogy with (30.3) without repeating the calculation. The reason 
is that the problem of quantisation of the oscillator is completely symmetrical 
as regards the coordinate and the momentum, and the oscillator wave func- 
tions in the p representation are the same as its ordinary coordinate wave 
functions (g being replaced by p/w; see Quantum Mechanics, § 23, Problem 1. 
The required distribution is therefore 


a 
dw, = (ses tanh 7) exp | -F tanh xr dp. (30.4) 


In the limit of classical mechanics (fm < T) this becomes the usual Maxwel- 
lian distribution: 
dw, = (22T)—"e-P°PT dp. (30.5) 


PROBLEM 
Determine the coordinate density matrix for a harmonic oscillator. 
SOLUTION. The coordinate density matrix for an oscillator in statistical equilib- 
rium is 


04,9) =a ye y4(@’) Wa(Q); 


see the footnote following (5.4). We put qg = r+s, q’ = r—s, and calculate the 
derivative (00/0s),. As in the similar calculation above, we find 


= ~ (1-+e—*/T) oy Qn, n+1lPn41(@) om | —v,(D Wy 41D). 


On calculating 50 = 4q—q)o i in the same way and comparing it with the deriva- 
tive, we obtain . 


_ @o/Os), = —so(20/) coth (foo/27), 
whence 


fio 
o@, q’)) = A(r) exp (-#5 age coth zr): 


t This is the squared modulus of the wave function of the ground state of the 
oscillator. 
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The function A(r) is determined by the condition that for s = 0, i.e. g= q’ = 71, 
the “diagonal elements” 0(q, g) of the density matrix agree with (30.3). The final 
result is 


_ (@ fio \¥®@ ¢ watdy?, fw wq-aP hw 
oq, qg) = es tanb >) om mae ae tanh = — gy coth at 


§ 31. The free energy in the Gibbs distribution 


According to formula (7.9) the entropy of a body can be calculated as the 
mean logarithm of its distribution function: 


S = —(log wx). 


Substituting the Gibbs distribution (28.3) gives 
S = —log A+E£/T, 


whence log A = (E—TS)T. But the mean energy E is just what is meant by 
the term “energy” in thermodynamics; hence E—TS = F and log A = F/T, 
i.e. the normalisation constant of the distribution is directly related to the free 
energy of the body. 

_ Thus the Gibbs distribution may be written in the form 


Ww, = eF-EaT, (31.1) 


and this is the form most frequently used. The same method gives in the clas- 
sical case, using (7.12), the expression . 


0 = (2nh)-5 elF-Ee. OV, (31.2) 


The normalisation condition for the distribution (31.1) is 
wa = eFTS e-EvT = 1, 


or ; 
F =—T log Y eT, (31.3) 


This formula is fundamental in thermodynamic applications of the Gibbs 
distribution. It affords, in principle, the possibility of calculating the thermo- 
dynamic functions for any body whose energy spectrum is known. 

The sum in the logarithm in (31.3) is usually called the partition function 
(or sum over states). It is just the trace of the operator exp (— A/T), where 
is the Hamiltonian of the body:* 


Z = Y,e—eT = tr exp (—A/T). (31.4) 


t In accordance with the general rules, exp (— 7/7) denotes an operator whose 
eigenfunctions are the same as those of the operator W and whose eigenvalues are 
e~=n/T, 
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This notation has the advantage that any complete set of wave functions may 
be used in order to calculate the trace. 

A similar formula in classical statistics is obtained from the normalisation 
condition for the distribution (31.2). First of all, however, we must take ac- 
count of the following fact, which was unimportant so long as we were discuss- 
ing the distribution function as such and not relating the normalisation 
coefficient to a particular quantitative property of the body, viz. its free 
energy. If, for example, two identical atoms change places, then afterwards 
the microstate of the body is represented by a different phase point, obtained 
from the original one by replacing the coordinates and momenta of one atom 
by those of the other. On the other hand, since the interchanged atoms are 
identical, the two states of the body are physically identical. Thus a number of 
points in phase space correspond to one physical microstate of the body. 
In integrating the distribution (31.2), however, each state must of course be 
taken only once.' In other words, we must integrate only over those regions 
of phase space which correspond to physically different states of the body. 
This will be denoted by a prime to the integral sign. 

Thus we have the formula 


F = —T log f e~ Ep. MIT dT’; (31.5) 


here and in all similar cases below, dJ’ denotes the volume element in phase 
Space divided by (2xh)’: 
dI° = dp dq/(2zh):. (31.6) 


Thus the partition function in the quantum formula (31.3) becomes an inte- 
gral over states. As already mentioned in § 29, the classical energy E(p, q) can 
always be written as the sum of the kinetic energy K(p) and the potential 
energy U(q). The kinetic energy is a quadratic function of the momenta, and 
the integration with respect to the latter can be effected in a general form. 
The problem of calculating the partition function therefore actually reduces 
to that of integrating the function e~¥'" with respect to the coordinates. 

In the practical calculation of the partition function it is usually convenient 
to extend the region of integration and include an appropriate correction 


t This becomes particularly evident if we consider the classical partition function 
(integral over states) as the limit of the quantum partition function. In the latter 
the summation is over all the different quantum states, and there is no problem 
(remembering that, because of the principle of symmetry of wave functions in 
quantum mechanics, the quantum state is unaffected by interchanges of identical 
particles), 

From the purely classical viewpoint the need for this interpretation of the statis- 
tical integration arises because otherwise the statistical weight would no longer 
be multiplicative, and so the entropy and the other thermodynamic ahennuce 
would no longer be additive. 
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factor. For example, let us consider a gas of N identical atoms. Then we can 
integrate with respect to the coordinates of each atom separately, extending 
the integration over the whole volume occupied by the gas; but the result 
must be divided by the number of possible permutations of N atoms, which 
is N!. In other words, the integral f " can be replaced by the integral over all 
phase space, divided by N!: 


: 1 
[ar = ay [ar (31.7) 
Similarly, it is convenient to extend the region of integration for a gas 
consisting of N identical molecules: the integration with respect to the co- 
ordinates of each molecule as a whole (i.e. the coordinates of its centre of 
mass) is carried out independently over the whole volume, whilst that with 
respect to the coordinates of the atoms within the molecule is carried out 
over the “volume” belonging to each molecule (i.e. over a small region in 
which there is an appreciable probability of finding the atoms forming the 
molecule). Then the integral must again be divided by M!. 


PROBLEMS 


PROBLEM 1. The potential energy of the interaction between the particles in a 
body is a homogeneous function of degree n in their coordinates. Using similarity 
arguments, determine the form of the free energy of such a body in classical sta- 
tistics. 


SoLuTION. In the partition function 
Z= J ’ e-IK@)+U@IT G7, 


we replace each q by Ag and each p by A"*p, where J is an arbitrary constant. If at 
the same time we replace T by A"7, the integrand is unchanged, but the limits of 
integration with respect to the coordinates are altered: the linear size of the region 
of integration is multiplied by 1/A, and so the volume is multiplied by 1/4. In 
order to restore the limits of integration, we must therefore at the same time replace 
V by A8V. The result of these changes is to multiply the integral by 1°NG+#/) 
because of the change of variables in dl" (s = 3N coordinates and the same number 
of momenta, N being the number of particles in the body). Thus we conclude that 
the substitutions V - A8V, T + A"T give 


Z —» J38NA+N/2)zZ, 


The most general form of function Z(V, 7) having this property is 


Z= T3NG2+ Ya) ¢( VT —3/"), 


where f is an arbitrary function of one variable. 
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Hence we find for the free energy an expression of the form 
F=—3($+1/n)NT log T+NTO(VT—*"/N), (1) 


which involves only one unknown function of one variable; the number N is includ- 
ed in the second term in (1) so that F shall have the necessary property of additiv- 
ity. 

PROBLEM 2. Derive the virial theorem for a macroscopic body for which the 
potential energy of interaction of the particles is a homogeneous function of degree 
n in their coordinates. 


SOLUTION. Following the derivation of the virial theorem in mechanics (see 
Mechanics, § 10), we calculate the time derivative of the sum Lrep, where r and p 
are the radius vectors and momenta of the particlesin the body. Since r = OK(p)/Op 
and K(p) is a homogeneous function of degree two in the momenta, we have 


OK(p) 


© Seep = Eps By teed = 2K) +Eeep. 


The particles in the body execute a motion in a finite region of space with velocities 
which do not become infinite. The quantity 2r-p is therefore bounded and the 
mean value of its time derivative is zero, so that 


2K+ (Strep) = 0, 


where K = K(p). The derivatives p are determined by the forces acting on the 
particles in the body. In summing over all particles we must take into account not 
only the forces of interaction between the particles but also the forces exerted on 
the surface of the body by surrounding bodies: 


(rep) = -(r a a b redf = —nU—3PV; 


the surface integral is transformed to a volume integral and we use the fact that 
div r = 3. Thus we have 2K—nU—3PV = 0 or, in terms of the total energy E = 
U+K, 

(n+2)K = nE+3PV. (2) 


This is the required theorem. It is valid in both classical and quantum theory. 
In the classical case, the mean kinetic energy K = 3NT7/2, and (2) gives 

E+(3/n)PV = 3($+1/n)NT. (3) 

This formula could also be derived from the expression (1) for the free energy 


(Problem 1). 
When the particles interact by Coulomb’s law (n =—1), we have from (2) 


K=—E+3PY. 


This is the limiting case of the relativistic relation 


E-3PV = Emc®/(i—v4/c), 


in which the energy E includes the rest energy of the particles in the body; see 
Fields,§ 35. 
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§ 32. Thermodynamic perturbation theory 


In the actual calculation of thermodynamic quantities there occur cases 
where the energy E(p, g) of a body contains relatively small terms which may 
be neglected to a first approximation. These may be, for instance, the poten- 
tial energy of the particles of the body in an external field. The conditions 
under which such terms may be regarded as small are discussed below. 

In these cases a kind of “perturbation theory” may be employed for the 
calculation of the thermodynamic quantities (R. E. Peierls, 1932). We shall 
first show how this is to be done when the classical Gibbs distribution is 
applicable. 

We write the energy E(p, g) in the form 


E(p, q) = Eo(p, 9) +V (p, 9), (32.1) 


where V represents the small terms. To calculate the free energy of the body, 
we put 
en FIT = | e7lFo(p.9)+ Vip, DVT QI" 


{ ear(i-7+ x) ar: (32.2) 


in the expansion in powers of V we shall always omit terms above the second 
order, in order to calculate the corrections only to the first and second orders 
of approximation. Taking logarithms and again expanding in series, we have 
to the same accuracy 


F= Fo+| (v- zp) noon ar 
= [ Fo~Eo(p, VT 1 
+55 ll Vel ovr ar 1 


where Fo denotes the “unperturbed” free energy, calculated for V = 0. 
The resulting integrals are the mean values of the corresponding quanti- 
ties over the “unperturbed” Gibbs distribution. Taking the averaging in 


this sense and noticing that V2—7? = ((V—V), we have finally 
Pecks 7-5 (W- Vy). (32.3) 


Thus the first-order correction to the free energy is just the mean value of 
the energy perturbation V. The second-order correction is always negative, 
and is determined by the mean square of the deviation of V from its mean 
value. In particular, if the mean value V is zero, the perturbation reduces 
the free energy. 
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A comparison of the terms of the second and first orders in (32.3) enables 
us to ascertain the condition for this perturbation method to be applicable. 
Here it must be remembered that both the mean value V and the mean 
square ((V—V)*) are roughly proportional to the number of particles; cf. 
the discussion in § 2 concerning r.m.s. fluctuations of the thermodynamic 
quantities for macroscopic bodies. We can therefore formulate the desired 
condition by requiring that the perturbation energy per particle should be 
small in comparison with 7.* 

Let us now carry out the corresponding calculations for the quantum case. 
Instead of (32.1) we must now use the analogous expression for the Hamilto- 
nian operator: 


H= Ao+P. 


According to the quantum perturbation theory (see Quantum Mechanics, 
§ 38), the energy levels of the perturbed system are given, correct to the 


second-order terms, by 


, V nin 
E, = ED +Van +>. ae (32.4) 


where the E are the unperturbed energy levels (assumed non-degenerate); 
the prime to the sum signifies that the term with m = n must be omitted. 
This expression is to be substituted in the formula 
d 


and expanded in the same way as above. We thus easily obtain 


t |Vam |? Wn 
F= Fo+ YVimWn +d) “EO-EO 


l 1 2 
_ oT ps ViWn + oT 2 V aot) ’ (32.5) 


where w,, = exp [(Fyp—E)/T] is the unperturbed Gibbs distribution. 
The diagonal matrix element V,,, is just the mean value of the perturbation 
energy V in the given (mth) quantum state. The sum 


Y Van W,= Pun 


is therefore the value of V averaged both over the quantum state of the body 
and over the (unperturbed) statistical distribution with respect to the vari- 


t In expanding the integrand in (32.2) we have, strictly speaking, expanded in 
terms of a quantity V/7, which is proportional to the number of particles and is 
therefore certainly not small, but the further expansion of the logarithm causes 
the large terms to cancel, and so a series in powers of a small quantity is obtained. 
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ous quantum states. This value gives the correction to the free energy in the 
first-order approximation, formally the same as the classical result above. 
Formula (32.5) may be rewritten as 


1 , Vn |? (Win Wn) 1 
F = Fo+P nn ae 8 2 » ~~ £O-EO == yal (Van— Pun: (32.6) 


All the second-order terms in this expression are negative, since w,,—w,, has 
the same sign as E©—EZ®. Thus the correction to the free energy in the 
second-order approximation is negative in the quantum case also. 

As in the classical case, the condition for this method to be applicable is 
that the perturbation energy per particle should be small compared with T. 
On the other hand, the condition for the applicability of the ordinary quan- 
tum perturbation theory (leading to the expression (32.4) for E,) is, as we 
know, that the matrix elements of the perturbation should be small compared 
with the separations of the corresponding energy levels; roughly speaking, 
the perturbation energy must be small compared with the separations of the 
energy levels between which allowed transitions can take place.’ 

These two conditions are not the same, since the temperature is unrelated 
to the energy levels of the body. It may happen that the perturbation energy 
is small compared with 7, but is not small, or indeed is even large, compared 
with the significant separations between energy levels. In such cases the “per- 
turbation theory” for thermodynamic quantities, i.e. formula (32.6), will be 
applicable while the perturbation theory for the energy levels themselves, i.e. 
formula (32.4), is not; that is, the limits of convergence of the expansion rep- 
resented by formula (32.6) may be wider than those of (32.4), from which 
the former expansion has been derived. 

The converse case is, of course, also possible (at sufficiently low tempera- 
tures). 

Formula (32.6) is considerably simplified if not only the perturbation 
energy but also the differences between energy levels are small in comparison 
with T. Expanding the difference w,,—w, in (32.6) in powers of (EO —E)/T, 
we find in this case 


F= F + Pan- + Ir (I Vam |?) - (Van Pu). 


The rule of matrix multiplication gives 


Da [Vnm|? + Vin =A 2 | Pam|? = 2 Vm Vn = Pies 


t These are in general the transitions in which the states of only a small number 
of particles in the body are changed. 


gt 
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and we obtain an expression which is formally exactly the same as formula 
(32.3). Thus in this case the quantum formula is in formal agreement with the 
classical formula.t 


§ 33, Expansion in powers of 4 


Formula (31.5) is essentially the first and principal term in an expansion of 
the quantum formula (31.3) for the free energy in powers of # in the quasi- 
classical case. It is of considerable interest to derive the next non-vanishing 
term in this expansion (E. P. Wigner, G. E. Uhlenbeck and L. Gropper, 
1932). 

The problem of calculating the free energy amounts to that of calculating 
the partition function. For this purpose we use the fact that the latter is the 
trace of the operator exp (—8H) (see (31.4)), with the notation 8 = 1/T in 
order to simplify the writing of the involved expressions. The trace of an 
operator may be calculated by means of any complete set of orthonormal 
wave functions. For these it is convenient to use the wave functions of free 
motion of a system of N non-interacting particles in a large but finite volume 
V. These functions are 


vp = ay exp |) py pail; (33.1) 


where the q, are the Cartesian coordinates of the particles and the p, the 
corresponding momenta, labelled by the suffix i, which takes the values 
1,2, ...,5, where s = 3N is the number of degrees of freedom of the system 
of N particles. 

The subsequent calculations apply equally to systems containing identical 
particles (atoms) and to those where the particles are different. In order to 
allow in a general manner for a possible difference between the particles, we 
shall add to the particle mass a suffix indicating the degree of freedom: m,. 
Of course the three m, corresponding to any one particle are always equal. 

The existence of identical particles in a body means that, in the quantum 
theory, exchange effects must be taken into account. This means, first of all, 
that the wave functions (33.1) must be made symmetrical or antisymmetrical 
in the particle coordinates, depending on the statistics obeyed by the parti- 
cles. It is found, however, that this effect leads only to exponentially small 
terms in the free energy, and so is of no interest. Secondly, the identity of 
particles in quantum mechanics affects the manner in which the summation 


t The more powerful methods of the “diagram technique”, allowing the whole 
perturbation theory series to be considered for thermodynamic quantities, will be 
described in Part 2. 
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over different values of the particle momenta must be carried out. We shall 
meet this later, for example in calculating partition functions for an ideal 
quantum gas. The effect produces a term of the third order in # in the free 
energy (as shown later) and so again does not affect the terms of order fi? 
which we shall calculate here. Thus the exchange effects can be ignored in 
the calculation. 

- In each of the wave functions (33.1) the momenta p, have definite constant 
values. The possible values of each p, form a dense discrete set (the distances 
between neighbouring values being inversely proportional to the linear 
dimensions of the volume occupied by the system). The summation of the 
matrix elements exp (—BH )yp With respect to all possible values of the 
momenta may therefore be replaced by integration with respect to 
p (dp = dp, dp, ... dp,), bearing in mind that the number of quantum 
states “belonging” to the volume V™ dp of phase space (all values of the 
coordinates of each particle in the volume V and values of the momenta 
in dp) is . 

VN dp/(2ahy . 


We shall use the notation 
1 = exp | G/M). paul exp (~BA) exp | rah (33.2) 


The required matrix elements are obtained by integrating with respect to all 
the coordinates: 


exp (—B Np = <r ral Tag. (33.3) 


The partition function is then obtained by integration with respect to the 
momenta. 

Altogether, therefore, we must integrate J over all phase space, or more 
precisely over those of its regions which correspond to physically different 
states of the body, as explained in § 31. This is again denoted by a prime to 
the integral sign: 


Z = YeFhe = [Id (33.4) 


Let us first calculate J by means of the following procedure. We take the 
derivative : 


3 = —exp [—(i/M)Zp.q] Afexp (i/)Zp.qi}, 
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the operator H] acting on all factors to its right, and expand the right-hand 
side, using the explicit expression for the Hamiltonian of the body: 


n= rye ne ee sh LS atu, (33.5) 


where U = U(q, do, .--» 9,) is the potential energy of interaction between 
all particles in the body. By means of (33.5) we obtain after a straightforward 
calculation the following equation for /: 


ol AR /2i Ol OF 
ag = EP M+ Sa (Fp Pigg +3q) 


where 


2 
E(p, 9) = ¥, a +U (33.6) 


is the usual classical expression for the energy of the body. 
This equation is to be solved with the obvious condition that J = 1 when 
B = O. The substitution 


T = e7 PEP. Dy (33.7) 
gives 
- 7h [_ dbp ou +2P Eh _ By OUT gta { 22 
= Yom, [- h 0q,*"' hh OG t(S4, -)' 


Cy OU d% 
Bal Ei eetaciire wea 33.8 
0q; 94; 247 | G2) 


with the boundary condition x = 1 for 8 = 0. 
In order to obtain an expansion in powers of fi, we solve equation (33.8) 
by successive approximations, putting 


= 1th thet... (33.9) 


with y1 = 0, y2 = 0, ... for 8 = 0. Substituting this expansion in equation 
(33.8) and separating terms in different powers of #, we obtain the equations 


OX , p; OU 
ap Pn Bai” 


a ° 0U OX o2U oud 
=> —~ 2iBp; —— 2ip; = — . 
= Sag - Bp, 0g; Mit <p; 04; B oq? +55) |: 
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The first equation gives y,, and then the second equation gives 7. A simple 
calculation leads to the results 


B52 eas p; OU 
i= 2 ie Og’ 
rt Pi a Pi Pr fold Of 

hea ¢ ( m; 3a) +% BD im mi me 0q:99% ooo) 

1 eu 

mG a? Lan, lags) a P dm m, 6g? * 

The required partition function (33.4) is 

Z = [ (lth, +H y)e-PEe.® aT’, (33.11) 


The term of the first order in / in this integral is easily seen to be zero, since 
the integrand y,e~°* in that term is an odd function of the momenta 
(E(p, 9) being quadratic in the momenta and 41, by (33.10), linear), and so 
the result on integrating with respect to momenta is zero. Thus we can write 
(33.11) as 

Z = (14+#%) [ e-PEO.O AT, 


where 72 is the value of y2 averaged over the classical Gibbs distribution: 


a f Age PEP, @) ar 
he = [ e-PH.0 ar 


Substituting this expression for the partition function in formula (31.3), we 
have for the free energy 


= Fa-G log (1 + A74,), 


or, to the same accuracy, 
F= Fy—#4,/p. (33.12) 


Here F,, is the free energy in classical statistics (formula (31.5)). 

Thus the next term after the classical expression in the expansion of the 
free energy is of the second order in #. This is not accidental: in equation 
(33.8), solved here by the method of successive approximations, the quantum 
constant appears only as if, and so the resulting expansion is one in powers 
of if; but the free energy, being a real quantity, can contain only powers of 
ih which are real. Thus this expansion of the free energy (ignoring exchange 
effects) is an expansion in even powers of 4h. 

It remains to calculate the mean value y%2. We have seen in § 29 that in 
classical statistics the probability distributions for the coordinates and 
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momenta are independent. The averaging over momenta and over coordinates 
can therefore be made separately. 

The mean value of the product of two different momenta is clearly zero. 
The mean value of the square p? is m,/B. We can therefore write 


(PiPry = (7i/B)Oix» 


where 6, = 1 for i= k and 0 fori # k. Having averaged with respect to 
momenta by means of this formula, we obtain 


2 A277 
= Fim (Bey) Em Be OO 
The two terms here may be combined, since the mean values are related by 
the formula __ ee 
“ae = os) . (33.14) 
This is easily seen by noticing that 
a e—FU dq; = ane’ +8 | (Sa) * e-PY aq. 


The first term on the right-hand side gives only a surface effect, and since 
the body is macroscopic this effect may be neglected in comparison with 
the second term. 

Substituting the resulting expression for 72 in formula (33.12), and replac- 
ing B by 1/T, we find the following final expression for the free energy: 


(=). (33.15) 


a Fatagra en (= 


We see that the correction to the classical value is always positive, and is 
determined by the mean squares of the forces acting on the particles. This 
correction decreases with increasing particle mass and increasing tempera- 
ture. 

According to the above discussion, the next term in the expansion given 
here would be of the fourth order. This enables us to calculate quite inde- 
pendently the term of order f® which occurs in the free energy because of the 
peculiarities of the summation over momenta resulting from the identity 
of particles in quantum mechanics. The term in question is formally the same 
as the correction term which appears in a similar calculation for an ideal 
gas, and is given by formula (56.14): | 


nS NS 


8) = ar 
FO = 45 pple (33.16) 
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for a body consisting of N identical particles. The upper sign applies for 
Fermi statistics and the lower sign for Bose statistics; g is the total degree 
of degeneracy with respect to the directions of the electron and nuclear 
angular momenta. 

From these formulae we can also obtain the correction terms in the proba- 
bility distribution functions for the coordinates and momenta of the atoms 
of the body. According to the general results in§ 5, the momentum probabil- 
ity distribution is given by the integral of J with respect to g (see (5.10)): 


dw, = constant x dp f{ I dg. 
The term y1e~°® jn I contains a total derivative with respect to the co- 


ordinates, and the integral of it gives a surface effect which can be neglected. 
Thus we have 


dw, = constantX exp (- Bye #/2m) dp [ (1+ #y,)e7P? dg. 


The third and fourth terms in the expression (33.10) for y2 give a small con- 
stant (not involving the momenta) on integration, and this can be neglected 
in the same approximation. Taking out also the factor [ e~°" dg and includ- 
ing it in the constant coefficient, we have 


OU ou 


dw, = constantx (- 2m) jive 5 a 
nstant x exp | —B )’ p#/2m, peas Fe OG) Oak 
B Pipe CU ee ap 
2 
2s 6% y mim, 09: 09x 
The mean values which appear here are related by 
eu  ,0u aU 
09: 04% 0g: 89K” 
similarly to (33.14). Hence 
= or . PiPx OU 3U Uo) 
OM acai ( B 2 Pi [2m) 1458 a) y mim, Og: 8 . iVk 
(33.17) 


This expression can be conveniently rewritten in the following final form: 


1 
dw, = constant X exp {- r| fi ~s EE PE. Ogi oe ou} se 
(33.18) 


the bracket in (33.17) being replaced by an exponential function to the same 
degree of accuracy. 
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Thus we see that the correction to the classical distribution function for 
the momenta is equivalent to adding to the kinetic energy in the exponent an 
expression quadratic in the momenta, with coefficients depending on the law 
of interaction between the particles in the body. 

If it is desired to find the probability distribution for any one momentum 
P; then (33.17) must be integrated with respect to all the other momenta. All 
the terms involving the squares p? (k + 7) will then give constants negligible 
compared with unity, while the terms containing products of different 
momenta give zero. The result is, again in exponential form, 


2 2 7Tau\! 
dw,, = constant exp | ee 1 — aT m, (53.) i dp;. (33.19) 


Thus the distribution obtained differs from the Maxwellian only in that the 
true temperature T is replaced by a somewhat higher “effective tempera- 


ture”: 
hi? ou \? 
Ten = T+ 57%, (= 


Similarly we can calculate the corrected coordinate distribution function 
by integrating 7 with respect to the momenta: 


dw, = constant X dg f I dp. 


The same calculations as led to (33.13) give 


1 aU 1 &U 
dw, = constant x exp {-7 |v — ar ae (5,,) +B 127 ie Og cat ee 
(33.20) 


§ 34. The Gibbs distribution for rotating bodies 


The problem of the thermodynamic relations for rotating bodies has already 
been considered in § 26. Let us now see how the Gibbs distribution is to be 
formulated for rotating bodies. This will complete the investigation of their 
statistical properties. As regards the uniform translational motion, Galileo’s 
relativity principle shows that, as already mentioned in § 26, this motion 
has only a trivial effect on the statistical properties and so needs no special 
consideration. 

In a system of coordinates rotating with the body, the usual Gibbs distri- 
bution is valid; in classical statistics, 


o = (2nfi)—* FE, aT , (34.1) 
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where E’(p, q) is the energy of the body in this system, as a function of the 
coordinates and momenta of its particles, and F’ the free energy in the same 
system (which, of course, is not the same as the free energy of the body when 
at rest). The energy E’(p, g) is related to the energy E(p, g) in a fixed system 
by 

E’(p, g) = E(p, q)— 2-M(p, 9), (34.2) 


where $2 is the angular velocity of rotation and M(p, g) the angular momen- 
tum of the body (see § 26). Substituting (34.2) in (34.1), we find the Gibbs 
distribution for a rotating body in the form’ 

= (Qnh)-! fF’ -E0. )+0-M(, OUT , (34,3) 


In classical statistics the Gibbs distribution for a rotating body can also 
be represented in another form. To obtain this, we use the following expres- 
sion for the energy of the body in the rotating coordinate system: 


E’ = L$mv2—§ Lm(QXr}P?+ U, (34.4) 


where the v’ are the velocities of the particles relative to the rotating system, 
and the r their radius vectors (see Mechanics, § 39). Denoting by 


Ed(v,r) = 2 $mv2+U | (34.5) 


the part of the energy which is independent of 2, we obtain the Gibbs distri- 
bution in the form 


0 = (Qnfi)-*exp {7 [r- Edy’, +z Zmaxey 


The function @ determines the probability corresponding to the element 
of phase space dx, dy, dz, ... dp}, dpi, dp}, ..., where p’ = mv’ +mQxr 
(see Mechanics, § 39). Since, in obtaining the differentials of the momenta, 
we must regard the coordinates as constant, dp’ = m dv’, and the probabil- 
ity distribution expressed in terms of the coordinates and velocities of the 
particle is 


z Bee Nip 1 : 
dw = Cexp iar EG ; r)— 25 m(2xr) |} 
Xdx, dy, dz, ... dvi, dry, dey, ...; (34.6) 


where C denotes for brevity the factor (2xf)~* together with the product of 
the particle masses which appears when we go from the momentum differen- 
tials to the velocity differentials. 


t The distribution (34.3), like the ordinary Gibbs distribution, is fully in agree- 
ment with the result (4.2) derived in § 4 from Liouville’s theorem: the logarithm 
of the distribution function is a linear function of the energy and angular momen- 
tum of the body. 
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For a body at rest we have 
dw = CelF-4oW. VT dx, dy1 dz, ... dv, dv, dvj4,..., (34.7) 


with the same expression (34.5) for Eo(v, r), now a function of the velocities in 
the fixed coordinate system. Thus we see that the Gibbs distribution for the 
coordinates and velocities for a rotating body differs from that for a body at 
rest only by the additional potential energy —$2m(2Qxry. In other words, 
as regards the statistical properties of the body, the rotation is equivalent to 
the existence of an external field corresponding to the centrifugal force. The 
statistical properties are not affected by the Coriolis force. 

It should be emphasised, however, that this last result applies only to 
classical statistics. In the quantum case the expression 


= exp[(F’-H+2.M)/T] (34.8) 


gives the statistical operator corresponding to (34.3) for a rotating body. 
Formally we can reduce this operator to a form analogous to (34.6), the 
velocities v’ being replaced by the operators ¥ = p’/m—Q2xr, but the com- 
ponents of this vector operator do not commute, unlike those of the operator 
? in the fixed system. The statistical operators corresponding to the expres- 
sions (34.6) and (34.7) will therefore in general be markedly different from each 
other, quite apart from the fact that one of them contains the centrifugal 
energy. 


§ 35. The Gibbs distribution for a variable number of particles 


So far we have always tacitly assumed that the number of particles in a body 
is some given constant, and have deliberately passed over the fact that in 
reality particles may be exchanged between different subsystems. In other 
words, the number N of particles in a subsystem will necessarily fluctuate 
about its mean value. In order to formulate precisely what we mean by the 
number of particles, we shall use the term subsystem to refer to a part of the 
system which is enclosed in a fixed volume. Then N will denote the number of 
particles within that volume.‘ 

‘Thus the problem arises of generalising the Gibbs distribution to bodies 
with.a variable number of particles. Here we shall write the formulae for 
bodies consisting of identical particles; the further generalisation to sys- 
tems containing different particles is obvious (§ 85). 


t In deriving the Gibbs distribution in§ 28 we have in essence already understood 
subsystems in this sense; in going from (28.2) to (28.3) we differentiated the entropy 
whilst regarding the volume of the body (and therefore of the medium) as constant. 
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The distribution function now depends not only on the energy of the quan- 
tum state but also on the number N of particles in the body, and the energy 
levels Ey are of course themselves different for different N (as indicated by 
the suffix N). The probability that the body contains N particles and is in the 
nth state will be denoted by w,y. 

The form of this function can be determined in exactly the same way as the 
function w, in § 28. The only difference is that the entropy of the medium is 
now a function not only of its energy E’ but also of the number N’ of particles 
in it: S’ = S’(E’, N’). Writing E’ = E —E,, and N’ = N—N (where N is 
the number of particles in the body, and N the given total number of par- 
ticles in the entire closed system, which is large compared with N), we have in 
accordance with (28.2) 


Wan = constant X exp {S’(ZO— a N—N)}\; 


the quantity JE’ is regarded as constant, as in § 28. 
Next, we expand S’ in powers of E,, and N, again taking only the linear 
terms. Equation (24.5), in the form 
dE P 


dS = —-+,, dV— 


pre e™ 


shows that (0S/0E), y = 1/T, (0S/ON), » = —u/T. Hence 


S’(E—E,n, NO—N) = S(E®, Neo — Fav , HY ; 


the chemical potential u (and the temperature) being the same for the body 
and the medium, from the conditions of equilibrium. 
Thus we obtain for the distribution function the expression 


Wan = AcN—-Eny)IT | (35.1) 
The normalisation constant A can be expressed in terms of the thermody- 
namic quantities in the same way as in§ 31. The entropy of the body is 


N E 
S = —(log Wan) = —log A-F +7 


and so 

T log A = E—TS—wuN. 
But £—7S = F, and the difference F— uN is the thermodynamic potential Q. 
Thus T log A = Q, and (35.1) may be rewritten as 


Wan = CO+UN—Egn)IT , (35.2) 
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This is the final formula for the Gibbs distribution for a variable number of 
particles.* 

The normalisation condition for the distribution (35.2) requires that the 
result of summing the w,, first over all quantum states (for a given N) and 
then over all values of N should be equal to unity: 


YL Way = eS (ory eva) = 1. 
n n 
Hence we obtain the following expression for the thermodynamic potential 2: 


Q = —Tlog Y jew y emer. (35.3) 


This formula together with (31.3) can be used to calculate the thermodynamic 
quantities for specific bodies. Formula (31.3) gives the free energy of the body 
as a function of T, N and V, and (35.3) gives the potential 2 as a function of 
T, wand V. 

In classical statistics the probability distribution has the form 


dwy = ON dp™ dq™ : 
where 
On = (nhi)~s el@+uN-Eny, QT , (35.4) 
The variable N is written as a subscript to the distribution function, and the 
same letter is written as a superscript to the element of phase volume in order 
to emphasise that a different phase space (of 2s dimensions) corresponds to 
each value of N. The formula for 2 correspondingly becomes 


Q = —T log » euNir f* ew En(P, DIT a. (35.5) 


Finally, we may say a few words concerning the relation between the Gibbs 
distribution (35.2) for a variable number of particles derived here and the 
previous distribution (31.1). First of all, it is clear that, for the determination 
of all the statistical properties of the body except the fluctuations of the total 
number of particles in it, these two distributions are entirely equivalent. On 
neglecting the fluctuations of the number N, we obtain 2+ uN = F, and the 
distribution (35.2) is identical with (31.1). 

The relation between the distributions (31.1) and (35.2) is to a certain 
extent analogous to that between the microcanonical and canonical distri- 
butions. The description of a subsystem by means of the microcanonical 
distribution is equivalent to neglecting the fluctuations of its total energy; the 
canonical distribution in its usual form (31.1) takes into account these fluc- 


t This distribution is sometimes called the grand canonical ensemble. 
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tuations. The latter form in turn neglects the fluctuations in the number of 
particles, and may be said to be “microcanonical with respect to the number 
of particles”; the distribution (35.2) is “canonical” with respect to both the 
energy and the number of particles. 

Thus all three distributions, the microcanonical and the two forms of the 
Gibbs distribution, are in principle suitable for determining the thermo- 
dynamic properties of the body. The only difference from this point of view 
lies in the degree of mathematical convenience. In practice the microcanon- 
ical distribution is the least convenient and is never used for this purpose. 
The Gibbs distribution for a variable number of particles is usually the most 
convenient. 


§ 36. The derivation of the thermodynamic relations from the Gibbs 
distribution 


The Gibbs distribution plays a fundamental part throughout statistical phys- 
ics. We shall therefore give here another justification of it. This distribution 
has essentially been derived in §§ 4 and 6 directly from Liouville’s theorem. 
We have seen that the application of Liouville’s theorem (together with 
considerations of the multiplicativity of distribution functions for subsys- 
tems) enables us to deduce that the logarithm of the distribution function of a 
subsystem must be a linear function of its energy: 


log w, = «+ E,, (36.1) 


the coefficients 6 being the same for all subsystems in a given closed system 
(see (6.4), and the corresponding relation (4.5) for the classical case). Hence 


Wy = ert FEn; 


using the purely formal notation B = —1/T, « = F/T, we have an expression: 
of the same form as the Gibbs distribution (31.1). It remains to show that the 
fundamental thermodynamic relations can be derived from the Gibbs distri- 
bution itself, i.e. in a purely statistical manner. 

We have already seen that the quantity 6, and therefore T, must be the 
same for all parts of a system in equilibrium. It is also evident that 8 < 0, i.e. 
T > 0, since otherwise the normalisation sum 2w,, must diverge: owing to- 
the presence of the kinetic energy of the particles, the energy E, can take 
arbitrarily large values. All these properties agree with the fundamental 
properties of the thermodynamic temperature. 

To derive a quantitative relation, we start from the normalisation condi- 
tion 

FET = 
» eF—E,/T = 1, 
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We differentiate this equation, regarding the left-hand side as a function of T 
and of various quantities 41, 42, ... which represent the external conditions 
to which the body considered is subject; these quantities may, for example, 
determine the shape and size of the volume occupied by the body. The energy 
levels E,, depend on A,, 4, ... aS parameters. 

Differentiation gives 


OE, F— 


Wn n E, La 
ee lar-3y da—-=—* ar'| sg: 


where for simplicity only one external parameter is used. Hence 


of, _ dT 
at T [f= »» wnEn)- 


On the left-hand side Zw, = 1, and on the right-hand side 


dF YY Wa = day Wa 


OEn  OEn 
2, WnBin = E, Wn aq a 
Using also the formulae F—E = —TS and* 
OF, /0A = 0#/0A, (36.2) 


we have finally 
dF = —S dT+0A/dA-da. 


This is the general form for the differential of the free energy. 

In the same way we can derive the Gibbs distribution for a variable number 
of particles. If the number of particles is regarded as a dynamical variable, it 
is clear that it will be an “integral of the motion”, and additive, for a closed 
system. We must therefore write 


log Wan = &O+BE,+yN, (36.3) 


where A, like 8, must be the same for all parts of a system in equilibrium. 
Putting « = Q/T, B = —1/T, y = p/T, we obtain a distribution of the form 
(35.2), and then by the same method as above we can deduce an expression 
for the differential of the potential Q. 


t If the Hamiltonian 7 (and therefore its eigenvalues E,) depends on a parameter 
A, then 


OE, [0A = (OBO) an; 


see Quantum Mechanics, (11.16). On statistical averaging this gives (36.2). 


CHAPTER IV 


IDEAL GASES 


§ 37. The Boltzmann distribution 


One of the most important subjects of study in statistical physics is an 
ideal gas. By this is meant a gas in which the interaction between the par- 
ticles (molecules) is so weak as to be negligible. Physically, this approxima- 
tion may be allowable either because the interaction of the particles is small 
whatever the distances between them or because the gas is sufficiently rare- 
fied. In the latter case, which is the more important, the rarefaction of the gas 
results in its molecules’ being almost always at considerable distances apart, 
such that the interaction forces are quite small. 

The absence of interaction between the molecules enables the quantum- 
mechanics problem of determining the energy levels E, of the gas as a whole 
to be reduced to that of determining the energy levels of a single molecule. 
These levels will be denoted by e,, the suffix k representing the set of quantum 
numbers which define the state of the molecule. The energies E, are then given 
by the sums of the energies of the various molecules. 

It must be remembered, however, that, even when there is no direct force 
interaction, quantum mechanics gives a peculiar mutual effect of particles 
that are in the same quantum state (called the exchange effect). For example, 
if the particles obey Fermi statistics, this effect has the result that no more 
than one particle can be in each quantum state at one time; a similar effect 
but in a different form occurs for particles which obey Bose statistics. 

Let n, be the number of particles in a gas which are in the kth quantum 
State; the numbers n, are called the occupation numbers of the various 
quantum states. Let us consider the problem of calculating the mean values 
n, of these numbers, and take in particular the extremely important case 
where for all k 

ny «1. (37.1) 


tT It should be emphasised that, when speaking of the quantum state of an indi- 
vidual particle, we shall always refer to states which are fully determined by a set 
of values of all the quantum numbers (including the orientation of the angular 
momentum of the particle, if any). These should not be confused with the quantum 
energy levels; several different quantum states correspond to a given energy level 
if the latter is degenerate. 
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Physically this case corresponds to a sufficiently rarefied gas. We shall later 
establish a criterion which ensures the fulfilment of this condition, but it may 
be mentioned immediately that it is in practice satisfied for all ordinary 
molecular or atomic gases. The condition would be violated only at such high 
densities that the matter concerned certainly could not be regarded as an 
ideal gas. 

The condition n, « 1 for the mean occupation numbers signifies that in 
fact not more than one particle is in each quantum state at any instant. 
Consequently, we may neglect not only the direct forces of interaction of the 
particles but also their indirect quantum interactions mentioned above. This 
in turn enables us to apply the Gibbs distribution formula to the individual 
molecules. For the Gibbs distribution has been derived for bodies which are 
relatively small, but at the same time macroscopic, parts of large closed sys- 
tems. The macroscopic nature of these bodies made it possible to regard them 
as quasi-closed, i.e. to neglect to some extent their interaction with other 
parts of the system. In the case under consideration the separate molecules of 
the gas are quasi-closed, although they are certainly not macroscopic bodies. 

Applying the Gibbs distribution formula to the gas molecules, we can say 
that the probability that a molecule is in the kth state is proportional to e~**7, 
and therefore so is the mean number n, of molecules in that state, i.e. 


Ny, = ae~wT, (37.2) 


where a is a constant given by the normalisation condition 


ym = N : (37.3) 


(N being the total number of particles in the gas). The distribution of mole- 
cules of an ideal gas among the various states that is given by formula (37.2) 
is called the Boltzmann distribution; it was discovered by Boltzmann for clas- 
sical statistics in 1877. 

The constant coefficient in (37.2) can be expressed in terms of the thermo- 
dynamic quantities for the gas. To do this we shall give another derivation of 
the formula, based on the application of the Gibbs distribution to the as- 
sembly of all particles in the gas that are in a given quantum state. We are 
able to do this (even if the numbers n, are not small) since there is no direct 
force of interaction between these particles and the remainder (or between any 
of the particles in an ideal gas), and the quantum exchange effects occur only 
for particles in the same state. Putting E = n,e,, N =n, and adding the 
suffix k to 2 in the general formula for the Gibbs distribution for a variable 
number of particles (35.2), we find the probability distribution for various 
values of nm, as : 

Wa, = let mu—eg VT , (37.4) 
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In particular, wo = e”*? is the probability that there are no particles in 
the state concerned. In the case of interest here, for which n, « 1, the 
probability wo is almost unity, and so in the expression w, = e@#+*—%)’ for 
the probability of finding one particle in the kth state we can put e?”? = 1 
to within terms of a higher order of smallness. Then w; = e“—®/", The 
probabilities of values n, > 1 must be taken as zero in the same approxima- 
tion. Hence 

Me = Walk = Wiel, 
ny 


and we have the Boltzmann distribution in the form 
Ny = eu—enyT , (37.5) 


Thus the coefficient in (37.2) is expressed in terms of the chemical potential 
of the gas. 


§ 38. The Boltzmann distribution in classical statistics 


If the motion of gas molecules (and of the atoms in them) were subject to 
classical mechanics, we could use, instead of the distribution over quantum 
states, the distribution of molecules in phase space, i.e. over momenta and 
coordinates. Let dN be the mean number of molecules “contained” in a vol- 
ume element of phase space of the molecule, dp dq = dp, ... dp, dq, ... dq, 
(r being the number of degrees of freedom of the molecule). We may write 
this as 

dN = n(p,g) dr, dr = dpdgq/(2xhy (38.1) 


and call n(p, q) the density in phase space (although dz differs by a factor 
(22h)~" from the volume element in phase space). We then have, instead of 


(37.5), 
n(p, g) = ele OT, (38.2) 


where «(p, qg) is the energy of the molecule as a function of the coordinates 
and momenta of its atoms. 

Usually, however, it is not the entire motion of the molecule which is 
quasi-classical, but only the motion corresponding to some of its degrees of 
freedom. In particular, in a gas which is not in an external field, the transla- 
tional motion of molecules is always quasi-classical. The kinetic energy of the 
translational motion then appears in the energy e, of the molecule as an inde- 
pendent term, while the remaining part of the energy does not involve the 
coordinates x, y, z and momenta p,, p,, Pp, of the centre of mass of the mole- 
cule. This enables us to separate from the general formula for the Boltzmann 
distribution a factor which gives the distribution of the gas molecules with 


oe 
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respect to these variables. The distribution of the molecules in the volume 
occupied by the gas is clearly just a uniform distribution, and we obtain for 
the number of molecules per unit volume with momenta (of the translational 
motion) in given intervals dp,, dp,, dp, the Maxwellian distribution : 


N 
ANy = Pete %P (—(P8+ 13+ PD/2mT] dp, dp, dpe, (38.3) 


N mir 


dN, = V Only? exp [—m(v} + 02+ v2)/2T] dv, dv, dv, (38.4) 


(m being the mass of a molecule), normalised to N/V particles per unit 
volume. 

Let us next consider a gas in an external field, in which the potential energy 
of a molecule depends only on the coordinates of its centre of mass: u = 
u(x, y, z) (for example, a gravitational field). If, as always occurs in prac- 
tice, the translational motion in this field is quasi-classical, then u(x, y, z) 
appears in the energy of the molecule as an independent term. The Max- 
wellian distribution for the velocities of the molecules remains unchanged, 
of course, while the distribution for the coordinates of the centre of mass is 
given by the formula 


dN, = noe" IT AV, (38.5) 


This formula gives the number of molecules in an element of volume dV = 
dx dy dz; the quantity 
n(r) = noe“ » 2T (38.6) 


is the number density of the particles. The constant mo is the density at points 
where u = 0. Formula (38.6) is called Boltzmann’s formula. 

In particular, in a uniform gravitational field along the z-axis, u = mgz, 
and the gas density distribution is given by the barometric formula 


 n(z) = noe—mszlT (38.7) 


where Mo is the density at the level z = 0. 

At large distances from the Earth, its gravitational field must be described 
by the exact Newtonian expression, the potential energy u vanishing at in- 
finity. According to formula (38.6) the gas density should remain finite and 
not zero at infinity, but a finite quantity of gas cannot be distributed in an 
infinite volume with a density which is nowhere zero. This means that in a 
gravitational field a gas (such as the atmosphere) cannot be in equilibrium 
and must be continuously dissipated into space. 


§ 39 Molecular Collisions 115 


PROBLEMS 


PROBLEM 1. Find the density of gas in a cylinder of radius R and length / rotat- 
ing about its axis with angular velocity Q, there being a total of N molecules in 
the cylinder. 


SOLUTION. It has been mentioned in § 34 that the rotation of a body as a whole 
is equivalent to the presence of an external field with potential energy —tmQy 
(where r is the distance from the axis of rotation). The gas density is therefore 


n(r) == Aematrt/27 
Normalisation gives 
Nm O2%ematr?/2T 


mM) = rena — 4)" 


PROBLEM 2, Find the momentum distribution of particles for a relativistic 
ideal gas. 


SOLUTION. The energy of a relativistic particle is given in terms of its momen- 
tum by € = c4/(m*c?+-p®), where c is the velocity of light. The normalised momen- 
tum distribution is 


N exp {—c+/(mPc? + p*)/T} dp, dp, dp, 


sh V 2T/mc*}?K, (mc?/T)+-(T]me)K (mc2/T)  — 4acmcP 


where K, and K, are Macdonald functions (Hankel functions of imaginary argu- 
ment). In the calculation of the normalisation integral the following formulae are 
used: 


J e77e08! sinh? ¢ de = K,(z)/z, 
0 


Ky(z) = —K,(z)/z—K,(z). 


§ 39. Molecular collisions 


The molecules of a gas enclosed in a vessel collide with its walls as they 
move. Let us calculate the mean number of impacts between the molecules 
of a gas and a unit area of the wall per unit time. 

We take an element of surface area of the vessel wall and define a co- 
ordinate system with the z-axis perpendicular to this element, which may 
then be written as dx dy. Of the molecules in the gas, those which reach the 
vessel wall in unit time, i.e. collide with it, are just those whose z coordinate 
does not exceed the component »v, of their velocity along that axis (which, 
of course, must also be directed towards the wall, not away from it). 

The number dy, of collisions of molecules per unit time (and per unit area 
of the wall surface), in which the velocity components are in given intervals 
dv,, dv,, dv, is therefore obtained by multiplying the distribution (38.4) by 
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the volume of a cylinder of unit base area and height 2,: 


3/2 
doe int exp [—m(v2 + v2 + 02)/2T]X v, dv, dv, dv,. (39.1) 


~V (aa 


From this we easily find the total number » of impacts of gas molecules on 
unit area of the vessel wall per unit time. To do so, we integrate (39.1) over 
all velocities v, from 0 to co and over v, and v, from — to © ; integration 
over v, from — o to 0 is not required, since when v, < 0 the molecule is 
travelling away from the wall, and so does not collide with it. Hence 


N/T P 
v=zE / Sami ~ Tam * (39.2) 


here we have expressed the density of the gas in terms of its pressure by 
means of Clapeyron’s equation. 

Formula (39.1) may be written in spherical polar coordinates in velocity 
space, using instead of v,, v,, v, the absolute magnitude of the velocity 
and the polar angles 8 and ¢ which define its direction. Taking the polar axis 
along the z-axis, we have v, = v cos 6 and 
ad ( th " e-moATy8 sin 6 cos 6 dO do dv. (39.3) 


dy, aT} 


~V 


Let us now consider collisions between gas molecules. To do this, we must 
first find the velocity distribution of the molecules (the term velocity every- 
where referring to the velocity of the centre of mass) relative to one another. 
We take any one gas molecule and consider the motion of all the other mole- 
cules relative to it, i.e. consider for each molecule not its absolute velocity v 
(relative to the walls of the vessel) but its velocity v’ relative to some other 
molecule. That is, instead of dealing with individual molecules, we always 
consider the relative motion of a pair of molecules, ignoring the motion of 
their common centre of mass. 

We know from mechanics that the energy of the relative motion of two 
particles of masses m, and mz is $yn’v®, where m! = mym2/(m1+ ma) is their 
“reduced mass” and v” their relative velocity. The relative-velocity distribu- 
tion of the molecules of an ideal gas therefore has the same form as the 
absolute-velocity distribution, except that m is replaced by the reduced mass 
m’. Since all the molecules are alike, m’ = $m, and the number of molecules 
per unit volume with a velocity relative to the selected molecule between v’ 
and v’-+ dv’ is 


ANy = (MY o—morrry2 dy’ (39,4) 
a7 
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A collision between molecules may be accompanied by various processes: 
deflection (scattering) through a certain angle, dissociation into atoms, and 
so on. The processes which occur in collisions are usually described by their 
cross-sections. The cross-section for a particular process which occurs in col- 
lisions between a given particle and others is the ratio of the probability of 
such a collision per unit time to the particle flux density (the latter being the 
number of such particles per unit volume multiplied by their velocity). The 
number of collisions (per unit time) between this and other particles which 
are accompanied by a certain process with cross-section o is therefore 

fe * 5 (Fr) { e~mATgr/3 day, (39.5) 
0 
The total number of such collisions per unit time throughout the volume of 
the gas is obviously »’N/2. 


PROBLEMS 


ProsBLEM 1, Find the number of impacts of gas molecules on unit area of the 
wall per unit time for which the angle between the direction of the velocity of the 
molecule and the normal to the surface lies between 9 and 6-+-d0. 


SOLUTION. 


1/2 
dv = ha (re) sin 0 cos 6 dé. 
V \ma 


PROBLEM 2, Find the number of impacts of gas molecules on unit area of the 
wall per unit time for which the absolute magnitude of the velocity lies between 
vand v-+dyv. 


SOLUTION. 


N 3/2 
dy, = rar) e~mor/2Ty3 dy, 


PROBLEM 3. Find the total kinetic energy E;,, of the gas molecules striking 
unit area of the wall per unit time. 
SOLUTION. 
E. ~V 27° 2T 
ine = mi mr 


PROBLEM 4, Find the number of collisions between one molecule and the rest 
per unit time, assuming the molecules to be rigid spheres of radius r. 


SOLUTION. The cross-section for collisions between molecules is then o = 
m(2r)? = 4ztr® (since a collision occurs whenever two molecules pass at a dis- 
tance less than 2r). Substitution in (39.5) gives 


p= l6r? |/— v= 16 1/ FP. 
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§ 40. Ideal gases not in equilibrium 


The Boltzmann distribution can also be derived, in a quite different man- 
ner, directly from the condition of maximum entropy of the gas as a whole, 
regarded as a closed system. This derivation is of considerable interest in 
itself, since it is based on a method whereby the entropy of gas in any non- 
equilibrium macroscopic state may be calculated. 

Any macroscopic state of an ideal gas may be described as follows. Let us 
distribute all the quantum states of an individual particle of the gas among 
groups each containing neighbouring states (which, in particular, have neigh- 
bouring energy values), both the number of states in each group and the 
number of particles in these states being still very large. Let the groups of 
states be numbered j = 1,2, ..., and let G, be the number of states in 
group j, and N, the number of particles in these states. Then the set of num- 
bers N, will completely describe the macroscopic state of the gas. 

The problem of calculating the entropy of the gas reduces to that of deter- 
mining the statistical weight AI" of a given macroscopic state, i.e. the number 
of microscopic ways in which this state can be realised. Regarding each 
group of N, particles as an independent system and denoting its statistical 
weight by Al’, we can write 

AF ={[Al;. (40.1) 
j 


Thus the problem reduces to that of calculating the AT’. 

In Boltzmann statistics the mean occupation numbers of all quantum states 
are small in comparison with unity. This means that the numbers of particles 
N, must be small compared with the numbers of states G, (N; « G;,), but of 
course themselves still large. As has been explained in § 37, the smallness of 
the mean occupation numbers enables us to suppose that all the particles are 
entirely independently distributed among the various states. Placing each of 
the N, particles in one of the G, states, we obtain altogether GM possible 
distributions, but among these the distributions which differ only by a per- 
mutation of particles are identical, since the particles themselves are identical. 
The number of permutations of N, particles is N,!, and so the statistical 
weight of the distribution of N, particles among G; states is 


AT, = GMIN;!. (40.2) 
The entropy of the gas is calculated as the logarithm of the statistical 


weight: 
S = log Ar = L' log AL). 


Substitution of (40.2) gives . 
S = }) (Nj; log G;—log Nj}). 
J 
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Since the numbers N, are large, we can use the approximate formula’ 


log N! = N log (N/e), (40.3) 
obtaining 
S = dN log (eG, /N;). (40.4) 


This formula gives the solution of the problem, determining the entropy of 
an ideal gas in any macroscopic state defined by the set of numbers N,. It may 
be rewritten by using the mean numbers 7, of particles in each of the quantum 
states in group j: n, = N,/G,. Then | 


S = 3 Gry log (e/nj). (40.5) 


If the motion of the particles is quasi-classical, then in this formula we can 
change to the particle distribution in phase space. Let the phase space of 
a particle be divided into regions Ap Aq, each of which is small but 
nevertheless contains a large number of particles. The numbers of quantum 
states belonging to these regions are 


Gy = Ap) Ag? /(2ahy = Ac, (40.6) 


where r is the number of degrees of freedom of the particle; the numbers of 
particles in these states may be written as N, = n(p, q) Ac, where n(p, q) is 
the particle density distribution in phase space. We substitute these expres- 
sions in (40.5), and use the fact that the regions 4t are small in size and 
large in number to replace the summation over j by integration over the 
whole phase space of the particle: 
S = Jn log (e/n) de. (40.7) 
In a state of equilibrium, the entropy must be a maximum (as applied to 
the ideal gas, this statement is sometimes called Boltzmann’s H theorem). We 
shall show how this condition may be used to find the distribution function 
for the gas particles in a state of statistical equilibrium. The problem is to 
find 7, such that the sum (40.5) has the maximum value possible under the 
subsidiary conditions 


2; = 2, Git =N, 
3 esNy =") eG jn; = E, 


t When N is large, the sum log N! = log 1+log 2+ ... +log N may be approx- 
imately replaced by the integral 


N 
j log x-dx, 
0 


which leads to (40.3). 
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which express the constancy of the total number of particles N and of the 
total energy E of the gas. Following the usual method of Lagrange’s undeter- 
mined multipliers, we have to equate to zero the derivatives 


O(S +aN +BE)/On; = 0, (40.8) 
where « and # are constants. Effecting the differentiation, we find 
G,(—log n;+%+e,) = 0, 


whence log n,; = «+e, or 
nj = exthy, 


This is just the Boltzmann distribution, the constants « and £ being given in 
terms of T and wu by « = p/T, B = —1/T+ 


§ 41. The free energy of an ideal Boltzmann gas 
We may use the general formula (31.3) 
F =—T log }\ e~*wT (41.1) 


to calculate the free energy of an ideal gas which obeys Boltzmann statistics. 

Writing the energy EZ, as a sum of energies e,, we can reduce the summation 
over all states of a gas to summation over all states of an individual molecule. 
Each state of the gas is defined by a set of N values of e, (where N is the num- 
ber of molecules in the gas), which in the Boltzmann case may be regarded 
as all different (there being not more than one molecule in each molecular 
state). Writing e~*~7 as a product of factors e~*’7 for each molecule and 
summing independently over all states of each molecule, we should obtain 


(x ear)" (41.2) 


The set of possible values of ¢, is the same for each molecule of the gas, and 
so the sums Xe~**” are also the same. 

The following point must be borne in mind, however. Each set of N differ- 
ent values «, which differs only in the distribution of the identical gas mole- 
cules over the levels 2, corresponds to the same quantum state of the gas. 
But in the partition function in formula (41.1) each state must be included 


t These values of « and 8 could have been foreseen: equations (40.8) can be 
written as a relation between differentials, dS-+-« dN+£ dE = 0, which must be the 
same as the differential of the internal energy at constant volume, dE = T dS+-4 dN. 
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only once.t We must therefore again divide the expression (41.2) by the 
number of possible permutations of N molecules, i.e. by N!.+ Thus 


Year = = (z cor)". (41.3) 


Substitution of this expression in (41.1) gives 
F = —TN log 2 e~T 4+.T log N!. 


Since N is very large, we can use the formula (40.3) for log N!. This gives the 
formula 


F=-NT log lempeo"], (41.4) 


which enables us to calculate the free energy of any gas consisting of identical 
particles obeying Boltzmann statistics. 
In classical statistics, formula (41.4) must be written in the form 


F = —NT log [(e/N) fe“ 7 dr}; (41.5) 


the integration is taken over the phase space of the molecule, and dz is defined 
by (38.1). 


§ 42. The equation of state of an ideal gas 


It has already been mentioned in § 38 that the translational motion of the 
molecules in a gas is always quasi-classical; the energy of a molecule may be 
written in the form 


Ex(Dxs Pys Dz) = (Pe + D3 +P2)/2m + ex, (42.1) 


where the first term is the kinetic energy of the translational motion, and «, 
denotes the energy levels corresponding to the rotation and internal state of 
the molecule; e, is independent of the velocities and coordinates of the centre 
of mass of the molecule (assuming that there is no external field). 

The partition function in the logarithm in formula (41.4) must now be 
replaced by the expression 


Oi | | exp [—ex(p)/T] d°p dV = V(mT /2nh#)" Yet; (42.2) 


t See the second footnote to § 31. 
t Here it is important that in Boltzmann statistics the terms containing the 
same €; in (41.2) are of negligible significance. 
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the integration over V (dV = dx dy.dz) is over the whole volume of the gas. 
For the free energy we obtain. ; m4 


3/2 
F =—NT log Fa (sa) 3 er, (42.3) 


The sum in (42.3) cannot, of course, be calculated in .a general .form 
without any assumptions as to the properties of the molecules, but an impor- 
tant fact is that it depends only on the temperature. The dependence of the 
free energy on the volume is therefore entirely determined by formula (42.3), 
and so we can derive from it various important general results concerning 
the properties of an ideal gas (which is not in an external field). 

Separating in (42.3) the term containing the volume, we may write this for- 
mula as 


_ F =—NT log (eV/N)+Nf(T), _. (42.4) 


where f(T’) is some function of the temperature. Hence the pressure of the gas 
is P = —OF/OV = NT/V, or 


PV = NT. (42.5) 


Thus we have the familiar equation of state of an ideal gas (Clapeyron’ s 
equation). If the temperature is measured in degrees, then’ 


PV = NET. (42.5a) 


Knowing F, we can find the other thermodynamic quantities also. For 
example, the thermodynamic potential is 


® = —NT log (eV/N)+Nf(T) + PV. 


Substituting V = NT/P according to (42.5) (since ® must be expressed as a 
function of P and 7’) and using a new function of temperature y(7) = f(T) 
—T log T, we obtain 


® = NT log P+ Nx(T). (42.6) 

The entropy is defined as 
S = —OF/0T = N log (eV/N)—Nf'(T), (42.7) 
or, as a function of P and T, | 


S = —00/8T = —N log P—Ny(T). (42.8) 


+ For a gram-molecule of gas (N = 6.023X 103 = Avegaies $s number), the 
product R == Nk is called the gas constant: R = 8.31410" erg/deg. 
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Finally, the energy is 
E = F4+TS = Nf(T)—NTf'(T). (42.9) 


We see that the energy is a function only of the temperature of the gas (and 
the same is true of the heat function W = E+PV = E+NT). This is evident 
a priori: since the molecules of an ideal gas are assumed not to interact, the 
change in their mean distance apart when the total volume of the gas varies 
cannot affect its energy. 

As well as E and W, the specific heats C, = (O£/0T), and C, = (OW/eT), 
are functions only of the temperature. In what follows it will be convenient to 
use the specific heats per molecule, denoted by lower-case letter c: 


om = Ne», Cp = Ney. (42.10) 
Since for an ideal gas W—E = NT, the difference C,—C, has a fixed value:' 
Cp—Cy = 1. | (42.11) 

PROBLEMS 


PRoBLEM 1, Find the work done on an ideal gas in an isothermal change of 
volume from V, to V2 (or of pressure from P, to P,). 


SOLUTION. The required work R is equal to the change in the free energy of the 
gas, and from (42.4) we have 


R= F,—F, = NT log (V,J¥%;)= = NT log (Po/P)). 


The quantity of heat absorbed in this process is 
Q = T(S.—S) = NT log (V,/V;). 


The latter result also follows directly from the fact that R+Q is the change of 
energy and is equal to zero for an isothermal process in an ideal gas. 


PROBLEM 2. Two vessels contain two identical ideal gases at the same tempera- 
ture T and with equal numbers of particles N but at different pressures P, and Pp. 
The vessels are then connected. Find the change in entropy. 


SoLuTiIon. Before the vessels are connected, the entropy of the two gases is 
equal to the sum of their entropies, Sy = —N log (P,P2)—2Ny'(T). After the con- 
nection, the temperature of the gases remains the same (as follows from the con- 
servation of energy for the two gases). The pressure is given by the relation 


42th 1 (Lyd) 


t Since the specific heat is the derivative of the energy (quantity of heat) with 
respect to temperature, C must be replaced by C/k in the formulae when ordinary 
units (degrees) are used. For example, fomanls (42.11) in ordinary units becomes 
Cyp—Cy = k. 
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The entropy is now 
- P,+Pe ; 
S = 2Nlog = BP, 2Ny'(T). 
The change in entropy is therefore 
(P, +P.) 
A = oo , 
S = Nlog 4P,P, 


PROBLEM 3. Find the energy of an ideal gas in a cylindrical vessel of radius R 
and length / rotating about its axis with angular velocity 2. 


SoLuTion. According to § 34, the rotation is equivalent to the presence of an 
external “centrifugal” field with potential energy u = —imQ* (7 being the dis- 
tance of a particle from the axis of rotation). 

When an external field is present, the integrand in (42.2) contains an extra factor 
e~4T, and so in the argument of the logarithm in (42.3) the volume V is replaced 


by the integral [ e~“T dV. Thus 


F = Fy—NT log ; | e~"T Ay, 


where Fy is the free energy of the gas in the absence of the external field. 
In the present case this formula for the free energy becomes (in a rotating coor- 
dinate system) 


1 


IR 
7 me ae ag aioe: 72/27" 
F’= Fy nT os ia | | 2nr dr dz 


or 
= Fy—NT log Ex 


(em2*R/27_. »| . 


The angular momentum of the gas is 
M =—0F’/8Q 
2NT NmR?2Q 
"QT [ae-mOtRAr * 
The energy in a system rotating with the body is 
ps ; NmQ?R* 
E’ = F’—T OF foT= oa —e-mmnniany TNT, 
and in a fixed system of coordinates (see (26.5)) 


NmQ2R2 


E = E’+MQ = Fat 3a matRAT)— ’ 


where Ey is the energy of the gas at rest. 
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§ 43. Ideal gases with constant specific heat 


We shall see later that in many important cases the specific heat of a gas 
is constant, independent of temperature, over a greater or smaller tempera- 
ture interval. For this reason we shall now calculate in a general form the 
thermodynamic quantities for such a gas. 

On differentiating the expression (42.9) for the energy, we find that the 
function f(T) is related to the specific heat c, by —-T/’"(T) = c,. Integration 
of this relation gives 


S(T) =—c,T log T~CT +0, 


where ¢ and é are constants. Substitution in (42.4) gives for the free energy 
the final expression 


F = Neo—NT log (eV/N)—Ne,T log T—N¢T. (43.1) 


The constant ¢ is called the chemical constant of the gas. For the energy we 


have 
E= Neéo+Ne,T, (43.2) 


a linear function of the temperature. 

The thermodynamic potential © of the gas is obtained by adding to (43.1) 
the quantity PV = NT, the volume of the gas being expressed in terms of the 
pressure and the temperature. The result is 


® = Neo+NT log P—Ne,T log T~NCT. (43.3) 
The heat function W = E+PV is 
W = Neo+NopT. (43.4) 


Finally, differentiating (43.1) and (43.3) with respect to temperature, we ob- 
tain the entropy in terms of T and V and of T and P respectively: 


S = N log (eV/N)+Nc, log T+(C+0,)N, (43.5) 
S =—N log P+Ne, log T+(C+c,)N. (43.6) 


From these expressions for the entropy we can, in particular, derive im- 
mediately a relation (called the Poisson adiabatic) between the volume, tem- 
perature and pressure of an ideal gas (of constant specific heat) undergoing 
adiabatic expansion or compression. Since the entropy remains constant in 
an adiabatic process, we have from (43.6) —N log P-+Nc, log T' = constant, 
whence 7/P = constant or, using (42.11), 


T’P'~y = constant, (43.7) 
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where denotes the constant ratio Be 
Y = Cp/Cy. (43.8) 


Using also the aguation of state PV = NT, we obtain relations between T and 
V,and PandV: 


TV’-! = constant, PV” = constant. (43.9) 


PROBLEMS 


PROBLEM 1. Two identical ideal gases at the same pressure P and containing the 
same number of particles N but at different temperatures 7, and Ty are in vessels 
with volumes V, and V,. The vessels are then connected. Find the change in en- 


tropy. 

SOLUTION. Before the vessels are connected, the entropy of the two gases, equal 
to the sum of their entropies, is by (43.6) Sp =—2N log P+Ne, log (7;7}).t 
After the connection, the temperatures of the gases become equal. The sum of the 
energies of the two gases remains constant. Using the expression (43.2) for the 
energy, we find T= 4(T,+T7,), where T is the final temperature. 

After the connection, the gas contains 2N particles and occupies a volume 
Vit, = MT, +7))/P. Its pressure is then 2N7/(V,+V.) = P, i.e. the same as 
before. The entropy is 


= —2N log P+2Nc, log ($7, + 27), 


and the change in entropy is 
AS = S—S, 
(7, +7 


= Ne, log aT,T> 


PROBLEM 2. Find the work done on an ideal gas in adiabatic compression. 


SOLUTION. In an adiabatic process the quantity of heat Q = 0, and so R = 
E,—E,, where E,—E, is the change in energy during the process. According to 
(43.2) R = Ne,(T,—7;), where 7, and 7; are the gas temperatures before and after 
the process. R can be expressed in terms of the initial and final volumes V, and V2 
by means of the relation (43.9): 

R= Ne,T[,/¥_)? 3 — 1] 
= Ne,To{l —(V2/V3)"-4}, 

PROBLEM 3. Find the quantity of heat, gained by a gas in an isochoric process, 
i.e. one which occurs at constant volume. 

SOLUTION, Since in this case the work R = 0, we have 


Q= E,—E, = Ne,fT,—T,). 


t We everywhere omit the constant terms in the entropy and energy which are 
unimportant in the solution of problems. 
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PROBLEM 4, Find the work done and quantity of heat gained in an isobaric 
process, i.e. one which occurs at constant pressure. 


SOLUTION. At constant pressure 
R =—P(V,—V), Q= W,-W,, 
whence 
: R= N(T,—To), Q = Ne,(T,—7;). 


PROBLEM 5. Find the work done on a gas and the quantity of heat which it gains 
in compression from volume V, to V, in accordance with the equation PV" = ea 
(a polytropic process). 


SOLUTION. The work is 
V; 
R=- | PdV = —(V¥}-"—pi-’, 
n—1 
Vv, 
Since the sum of the quantity of heat gained and the work done is equal to the 


total change in energy, we have Q = Ne,(T,— T)— —R, and since T= PV/N= 
(a/N)V1-", 


Q= a( cot a =) a _yi-n), 


PROBLEM 6. Find the work done on an ideal gas and the quantity of heat which 
it gains on going through a cyclic process (i.e. one in which it returns to its initial 
state at the end of the process), consisting of two isochoric and two isobaric pro- 
cesses: the gas goes from a state with pressure and volume P,, V, to states P,, 
Vy; Po, Vo; Po, V3 Pi, Vz again. 


SOLUTION. The change in energy in a cyclic process is zero, since the initial and 
final states are the same. The work done and the quantity of heat gained in such 
a process are therefore the same with opposite signs (R = —Q). In order to find 
R in the present case, we note that in isochoric processes the work done is zero, 
and for the two isobaric processes it is respectively —P,(V.—V,) and —P,(V,—V,). 
Thus R = (V¥,—V;) (P2—P,). 


PROBLEM 7. The same as Problem 6, but for a cyclic process consisting of two 
isochoric and two isothermal processes, the successive volumes and temperatures. 
of the gas being V3, Ti; Vi, To; Ve, To; Vo, Ti; Ms T;. 


SOLUTION. 
— (T, = T;) N log (V/V). 


PROBLEM 8. The same as Problem 6, but for a cyclic process consisting of two 
isothermal and two adiabatic processes, the successive entropies, temperatures. 
and pressures being S,, 7, Py; $1, To; So, Tos Po: Se, 713 S15 Ty, Py. 


SOLUTION. 
Q = (T,—-T)) (Se—S)) 
= (T)—T,) [N log (P;/P2)+-Ncp log (T>/7;)I.- 
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PROBLEM 9. The same as Problem 6, but for a cyclic process consisting of two 
isobaric and two isothermal processes, the successive states being P;, 71; P1, 72; 
Po, Tz; Pg, T,; Py, Ty. 


SOLUTION. The work done on the gas in the isobaric processes is (see Problem 4) 


N(1,—T2) and M(T,—7;), and that in the isothermal processes is N7} log (P,/P1) 
and N7; log (P,/P.). The sum of these is R = N(7,—T7,) log (P2/P;). 


PROBLEM 10. The same as Problem 6, but for a cyclic process consisting of two 
isobaric and two adiabatic processes, the successive states being P,, S,, 7; P,, 
So} Po, So, 72; Po» S13 P,, Sy, Ti. 


SOLUTION. The temperature in the second state is T,(P,/P,)°-””, and in the 
fourth state 7;(P, /P.)¢-”” ; these are obtained from 7, and 7, by means of (43.7). 
The quantity of heat gained by the gas in adiabatic processes is zero, and in the 
isobaric processes it is (see Problem 4) 

Ne,{To(Po/P,)o- WY —T,} and 
NeglT,(P;|P2)°-”/”—T9). 
Hence 
Q = Neg? [(P, /P2)4-Y" — 1]+-Ne,To[(P2/P))a-Y” — 1}. 


PROBLEM 11. The same as Problem 6, but for a cyclic process consisting of two 
isochoric and two adiabatic processes, the successive states being V,, S,, 7,; 
Vi, Se; Vo. Sos Ta; Vo, S13 Vi, $1, 74. 


SOLUTION. Using the result of Problem 2, we find 
R= Ne,To[1 —(V,/ Vi +Ne 7 [1 —(V,/ V,)’—}. 


PROBLEM 12. Determine the maximum work that can be obtained by connecting 
vessels containing two identical ideal gases at the same temperature Ty and with 
equa! numbers of particles N but having different volumes V, and Vp. 


SOLUTION. The maximum work is done if the process occurs reversibly (i.e. 
if the entropy remains constant), and is equal to the difference between the ener- 
gies before and after the process (§ 19). Before the connection of the vessels, the 
entropy of the two gases is equal to the sum of their entropies, i.e. by (43.5) 


So = Nlog (eV, V2/N*)+2Ne, log To . 
After the connection we have a gas consisting of 2N particles occupying a volume 
V+ V, at some temperature 7. Its entropy is 
S = 2N log [e(V, + Vq)/2N]-+2Ne, log T. 


Equating S) and S gives the temperature 7: 
4V,V, i 


T=Tp) fea +VF 


The energy of the two gases before and after the connection is Ey = 2Nc,T, and 
E = 2Ne,T respectively. The maximum work is therefore 


4V,V, \G@-DR 
R - E,-E= 2 —T)= | oe . 
max = Ey—E= 2Ne(To—T) 2NesTo| (a, : a) 
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PROBLEM 13. The same as Problem 12, but for gases with the same pressure Py 
and different temperatures 7, and 7, before the connection of the vessels. 


SOLUTION. We have similarly 


Roux = Ned T,+T,-2 VT) ||" 
‘max = co 92" / (TT. latae| 
PROBLEM 14. Find the minimum work that must be done on an ideal gas in 


order to compress it from pressure P, to P, at a constant temperature equal to 
that of the surrounding medium (TJ = 7,). 


SOLUTION. According to (20.2) the minimum work is Ryj, = (E,—E,)— 
— 7 (S.—S,)+Po(V_.—V,), where the suffixes 1 and 2 refer to the gas before and 
after compression. In the present case the energy FE is unchanged (since the temper- 
ature is constant), ic. E,—E, = 0. Using (43.6), we find the change of entropy 
corresponding to the change of pressure from P, to Py: Sy—S, = N log (P,/P2), 
and the change of volume is V,—V, = N7)(1/P,—1/P,). Hence 


1 1\7 
Ruin = NT oe BrP p- ? alk 


PROBLEM 15. Determine the maximum work which can be obtained from an 
ideal gas cooled from temperature T to the temperature of the medium 7, at 
constant volume. 


SOLUTION. From the general formula (20.3), 
Runax = Neo(T—-T)+Negl log (To/T). 


PROBLEM 16. The same as Problem 15, but for a gas cooled from temperature 
T to the temperature of the medium Ty and at the same time expanding from pres- 
sure P to the pressure of the medium Pp. 


SOLUTION. 
Rmax = Ne,(T—T))+NTp log (P/Po)+NepTq log (To/T)+-N(TPo/P—-T)). 


PROBLEM 17. Gas at temperature 7, flows from a large thermally isolated reser- 
voir into an empty thermally isolated vessel, the gas pressure in the reservoir 
remaining constant. Find the change in the gas temperature. 


SOLUTION. The energy E of the gas in the vessel consists of the energy Ey which 
it had in the reservoir and the work done on it to “expel” it from the reservoir. 
Since the state of the gas in the reservoir may be regarded as steady, we have the 
condition Wy = E(cf.§ 18). The gas temperature in the vessel is therefore T = yTp. 


§ 44. The law of equipartition 


Before going on to calculate in detail the thermodynamic quantities for 
gases, making allowance for the various quantum effects, it is useful to con- 
sider the same problem from the point of view of purely classical statistics. 


10° 
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We shall see later when and to what extent the results obtained are applicable 
to actual gases. 

A molecule is a configuration of atoms executing small oscillations about 
certain equilibrium positions which correspond to minimum potential energy 
of their interaction. The potential energy is then of the form. 

vib 
u=Eeot > GiKGides 
kel 
where €9 is the potential energy of interaction of the atoms when they are 
all in their equilibrium positions; the second term is a quadratic function of 
the coordinates which give the deviations of the atoms from their equilibrium 
positions. The number r,,, of coordinates in this function is the number of 
vibrational degrees of freedom of the molecule. 

This number can be determined from the number 7 of atoms in the mole- 
cule. A molecule containing n atoms has a total of 3n degrees of freedom. 
Three of these correspond to the translational motion of the molecule as a 
whole, and three to its rotation as a whole. If all the atoms are collinear (and 
in particular for a diatomic molecule) there are only two rotational degrees 
of freedom. Thus a non-linear molecule of n atoms has 3n—6 vibrational 
degrees of freedom, and a linear one has 3n—5. For n = 1 there are, of 
course, no vibrational degrees of freedom, since all three degrees of freedom 
of an atom correspond to translational motion. 

The total energy « of the molecule is the sum of the potential and kinetic 
energies. The latter is a quadratic function of all the momenta, and these are 
equal in number to the full 3n degrees of freedom of the molecule. The energy 
e is therefore of the form ¢ = &+f(p,q), where f;,(p, 4) is a quadratic 
function of the momenta and coordinates; the total number of variables in 
this function is / = 6n—6 (for a non-linear molecule) or 6n—5 (for a linear 
molecule); in a monatomic gas, / = 3, since the coordinates do not Peppes 
at all in the expression for the energy. 

Substituting this expression for the energy in (41.5) we have 


ere t0/T 
F =—NT log W efnlr. IT dr, 
In order to find the dependence on temperature of the integral in this formula 
we substitute p = p’ /T, q = q' VT for all the / variables on which the func- 
tion fy(p, 7) depends. Since this function is quadratic, f,(p, 9) = Thy(p’, 7), 
and T cancels in the exponent of the integrand. The transformation of the 
differentials of these variables in dr gives a factor T"", which can be taken 
outside the integral. The integration over the vibrational coordinates gq is 
taken over the range of values corresponding to vibrations in which the 
atoms remain within the molecule. However, since the integrand diminishes 
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rapidly with increasing q, the integration may be extended to the whole range 
from — co to oo, as well as over all the momenta. The above-mentioned 
change of variables then. leaves the limits of integration unaltered, and the 
whole integral is a constant independent of temperature. Using also the fact 
that the integration with respect to the coordinates of the centre of mass of 
the molecule gives the volume V occupied by the gas, we obtain for the free 
energy an expression of the form 


| F =—NT log(AVe-“"T'(N), 


where A is a constant. Expanding the logarithm, we have an expression of 
exactly the type (43.1) with a constant specific heat equat to 


| cy = 4. (44, 1 
The specific heat c, = ¢,+ 1 is accordingly 
C= (+2). (44.2) 


Thus we see that a purely classical ideal gas must have a constant specific 
heat. Formula (44,1) enables us to state the following rule. Each variable in 
the energy e(p, q) of the molecule gives an equal contribution of 4 to the 
specific heat c, of the gas (44 in ordinary units) or, what is the same thing, 
an equal contribution of 4T to its energy. This is called the law of equiparti- 
tion. ; 

Since for the translational and rotational degrees of freedom the energy 
(p, gq) contains only the corresponding momenta, we can say that each of 
these degrees of freedom gives a contribution of 4. to the specific heat. Each 
vibrational degree of freedom corresponds to two variables (coordinate and 
momentum) in the energy «(p, q), and its contribution to the specific heat 
is 1. 

For the model considered here it is easy to find a general formula for the 
energy distribution of the gas molecules. For convenience we shall measure 
the energy of a molecule from the value éo, i.e. omit this constant from the 
expression for «(p, q). Let us consider the volume in the phase space of the 
molecule whose points correspond to values of e(p, q) not exceeding a given 
value ¢, i.e. determine the integral r(e) = { dz taken over the region «(p, q) 
« ¢, According to the foregoing discussion, e(p, q) is a quadratic function of 
I variables. We replace those / quantities p, g on which the energy «(p, q) 
depends by new variables p’ = p/+/«, q’ = q/e. Then the condition «(p, q) 
< ¢ becomes «(p’, q’) « 1, and f dr becomes ” f dr’. The integral f de’ is 
clearly independent of «, and so r = constantXe", whence dr(e) = con- 
stant X ef" de and the energy probability distribution is 


dw, = Ae~"Tet!-} de. 
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Determining A from the normalisation condition, we find 


I 


=: ——____ e~#/Tgtl-1 de, (44.3) 
Tr (3) 


dw, 


PROBLEM 


Find the specific heat of an ideal gas in the extreme relativistic case, where the 
energy of a particle is related to its momentum by &¢ = ep, ¢ being the velocity 
of light. 


SoLuTion. According to (41.5) we have 


eV 
a 1 —— —cpll Aa 2d 
F NT log | 4 p~ ap 


or, after carrying out the integration, 
F=—NT log (AVT3/N), 


where A is a constant. The specific heat is therefore c, = 3, which is twice the value 
for a non-relativistic monatomic gas. 


§ 45. Monatomic ideal gases 


The complete calculation of the free energy (and therefore of the other 
thermodynamic quantities) for an ideal gas requires a calculation of the 
specific form of the partition function in the logarithm in (42.3), 


Z= pew 


Here «, are the energy levels of the atom or molecule (the kinetic energy of 
the translational motion of the particle being excluded). If the summation is 
taken only over all the different energy levels, it must be remembered that a 
Ievel may be degenerate, and in this case the corresponding term must appear 
in the sum over all states as many times as the degree of degeneracy. Let this 
be g,. The degree of degeneracy of the level is often called in this connection 
its statistical weight. Omitting for brevity the prime in ¢,, we can write the 
partition function concerned in the form 


Z = Vgenw". (45.1) 


The free energy of the gas is 


3/2 
F =—NT log Ei (sa) Z|. (45.2) 
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Turning now to the consideration of monatomic gases, we must first of all 
make the following important comment. As the gas temperature increases, so 
does the number of atoms in excited states, including the states of the 
continuous spectrum, which correspond to ionisation of the atom. When the 
temperature is not too high, the relative number of ionised atoms in the gas 
is negligible, but the gas is almost completely ionised at temperatures 7 of 
the order of the ionisation energy J,,,, and not only for T > J,,, (see § 104). 
Thus a non-ionised gas can reasonably be considered only at temperatures 
such that T <« J,,,.* 

The atomic terms (neglecting their fine structure) are so situated that the 
separation between the ground state and the first excited level is comparable 
with the ionisation energy. At temperatures T « Jj,,,, the gas will therefore 
be practically free not only of ionised atoms but also of excited atoms, and so 
all the atoms may be regarded as being in the ground state. 

Let us first consider the simplest case, that of atoms which in their ground 
state have neither orbital angular momentum nor spin (L = S = 0), such as 
the atoms of the inert gases. The ground state is not degenerate, and the 
partition function reduces to a single term, Z = e~?, For monatomic 
gases it is customary to put &9 = 0, i.e. to measure the energy from the ground 
state of the atom, so that Z = 1. Expanding the logarithm in (45.2) as a sum 
of logarithms, we obtain for the free energy an expression of the type (43.1), 
with constant specific heat 


cy = 3 (45.3) 
and chemical constant 


3 m 


(O. Sackur and H. Tetrode, 1912). 

This value of the specific heat is due entirely to the translational degrees of 
freedom of the atom (4 for each degree of freedom); it will be remembered 
that the translational motion of the gas particles is always quasi-classical. 
The “electronic degrees of freedom” under these conditions (no excited 
atoms in the gas) have, of course, no effect on the thermodynamic quanti- 
ties.t 


t For different atoms the temperature /,,,/k lies between 5X 104 degrees (alkali 
metal atoms) and 28 x 104 degrees (helium). 

+ The electronic part of the thermodynamic quantities, naturally, can never be 
treated classically. In this connection we may note the fact (which in essence has 
been tacitly assumed already) that in classical statistics the atoms must be regard- 
ed as particles without internal structure. The impossibility of applying to effects 
within the atom a statistics based on classical mechanics is further shown by the 
absurd result obtained on substituting the interaction energy between the electrons 
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These expressions.enable us to deduce a criterion for the valiguty of Boltz- 
mann statistics. In this statistics it is assumed that 


sar ala <i. 
(e (37.1)). It is dearly sufficient to require the flim of the condition 


eT ge 


For the chemical Sotential b= IN we have from (43. 3), with c, and ¢ 
given by (45. 3) and (45.4), 


ae P /2nfP\3 
ea Pos laa (a) | 


N [2nh®\ 3? 
= Tlog 7 (ar) | (45.5) 
Thus we obtain the condition | 
(N/V) (f2/mT V2 < 1. | (45.6) 


For a given temperature, this condition requires that the gas should be 
sufficiently rarefied. Substitution of numerical values shows that in practice, 
for any atomic (or molecular) gas, this condition can be violated only at 
densities where the interaction of the particles becomes important and the 
gas can in any case no longer be regarded as ideal. 

It is useful to note the following intuitive interpretation of the above con- 
dition. Since the majority of atoms have energies of the order of T, and 
therefore momenta of the order of »/(mT), we can say that all the atoms oc- 
cupy in phase space a volume of the order of V(mT)*”, corresponding to 
~ V(mT)*"/#> quantum states. In the Boltzmann case this number must be 
large compared with the number N of particles, and hence we have (45.6). 

Finally, we may make the following comment. The formulae derived in 
this section appear at first sight to contradict Nernst’s theorem, since neither 
the entropy nor the specific heat is zero at T = 0. However, it must be 
remembered that, under the conditions for which Nernst’s theorem is stated, 
all actual gases condense at sufficiently low temperatures. For Nernst’s 
theorem requires that the entropy of a body should tend to zero at T = 0 for 
a fixed value of its volume. But as T — 0 the saturated vapour pressure of all 


and the atomic nucleus in the classical distribution formulae. This energy is of the 
form —a/r, where r is the distance of the electron from the nucleus and a is a con- 
stant. The substitution would give a factor e@/T in the distribution, which becomes 
infinite for 7 = 0. This would mean that all the electrons would have to “fall” into 
the nucleus in thermal equilibrium. 
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substances becomes arbitrarily small, so that a fixed finite quantity of matter 
in a fixed finite volume cannot remain gaseous as T — 0. 

If we consider a model of a gas, possible in principle, which consists of 
mutually repulsive particles, then, although such a gas will never condense, at 
sufficiently low temperatures Boltzmann statistics ceases to be valid, and the 
application of Fermi or Bose statistics leads, as we shall see later, to expres- 
sions which are in agr*ement with Nernst’s theorem. 


§ 46. Monatomic gases. The effect of the electronic angular momentum 


If only one of the angular momenta L and S is non-zero in the ground 
state of the atom, then this state again has no fine structure. In practice the 
absence of fine structure of the ground state is always due to a zero orbital 
angular momentum; the spin S is sometimes not zero (for example, atoms in 
the vapour of alkali metals). 

A level with spin S is (2S +1)-fold degenerate. The only difference as com- 
pared with the case discussed in § 45 is that the partition function Z is now 
2S +1 instead of 1, and so the chemical constant (45.4) is increased by the 
quantity’ 


Cs = log(2S + 1). (46.1) 


If the ground term of an atom has a fine structure, it must be remembered 
that the intervals in this structure may generally be comparable with T, and 
so all the components of the fine structure of the ground state must be taken 
into account in the partition function. 

The fine-structure components differ in the value of the total angular 
momentum of the atom (with given orbital angular momentum L and spin S). 
Let these levels, measured from the lowest of them, be denoted by ¢,. Each 
level with a given J is (2J + 1)-fold degenerate with respect to orientations of 
the total angular momentum.* The partition function therefore becomes 


Fs y (27 + 1)e-*v’T ; (46.2) 


t We may write out for reference the formula for the chemical potential of a 
monatomic ideal gas with statistical weight (degree of degeneracy) of the ground 


state g: 
P  (2nhi?\3? N (2nt?\3? 
= Til ———— | —— = 1 exer e . 
earn lira) Tae (mr) | ot 


This applies also toa Boltzmann gas of elementary particles; for instance, in an 
electron gas g = 2. 

t We assume that Russell-Saunders coupling is valid in the atom; see Quanturrt 
Mechanics, § 72. 
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the summation is taken over all possible values of J for the given L and S. 
We obtain for the free energy 


3/2 
F = —NT log IF (se) YQ2r+ Dew |, (46.3) 

This expression becomes considerably simpler in two limiting cases. Let 
us assume that the temperature is so high that T is large in comparison with 
all the fine-structure intervals: J >> ¢,;. Then we can put e~¥7 = 1, 
and Z becomes simply the total number of fine-structure components 
(2S + 1)(2L+ 1). The expression for the free energy involves the constant 
specific heat c, = 2 as before, and the quantity 


Sst = log [(2S+1)QL+1)} (46.4) 


is added to the chemical constant (45.4). 

Similar expressions for the thermodynamic quantities (with a different C) 
are obtained in the opposite limiting case where T is small compared with the 
fine-structure intervals.' In this case all terms may be neglected in the sum 
(46.2) except the one with e, = 0 (the lowest component of the fine structure, 
i.e. the ground state of the atom). In consequence the quantity added to the 
chemical constant (45.4) is 


ox = log(2J +1), (46.5) 


where J is the total angular momentum of the atom in the ground state. 

Thus, when the ground term of the atom has fine structure, the specific 
heat of the gas at sufficiently low and sufficiently high temperatures has the 
same constant value, but in the intermediate range it depends on the tem- 
perature and passes through a maximum. It must be borne in mind, however, 
that for gases concerned in practice (heavy-metal vapours, atomic oxygen, 
etc.) only the range of high temperatures, where the specific heat becomes con- 
stant, is of importance. 

So far we have ignored the possibility that the atom has a non-zero nuclear 
spin 7. The existence of such a spin causes the hyperfine splitting of atomic 
levels. The intervals in this structure are, however, so small that they may be 
neglected in comparison with T at all temperatures where the gas remains a 
gas.t In calculating the partition function, the energy differences between the 


t As examples, the quantities ¢ ;/k for the components of the triplet ground state 
of the oxygen atom are 230° and 320°; for those of the quintet ground state of 
the iron atom they are between 600° and 1400°; and for the doublet ground state 
of the chlorine atom 1300°. 

t The temperatures corresponding to the hyperfine structure intervals of various 
atoms range from 0.1° to 1.5°. 
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hyperfine multiplet components may be entirely neglected, and the splitting 
need be taken into account only as increasing the degree of degeneracy of 
each level (and therefore the sum Z) by a factor 2/ +1. Accordingly, the free 
energy contains an additional “nuclear” term 


Faue = —NT log (2i-+ 1). (46.6) 


This term does not affect the specific heat of the gas (the corresponding 
energy E,,,, == 0) and simply changes the entropy by S,,, = N log (2i+ 1), 
and the chemical constant by ¢,,,, = log (2i+ 1). 

Because the interaction between the nuclear spin and the electron shells is 
extremely weak, the “nuclear” part of the thermodynamic quantities usually 
plays no part in the various thermal processes and does not appear in the 
equations. We shall therefore omit these terms, as is usually done; in other 
words, we shall measure the entropy not from zero but from the value S,,,, 
due to the nuclear spins. 


§ 47. Diatomic gases with molecules of unlike atoms. Rotation of molecules 


Turning now to the calculation of the thermodynamic quantities for a 
diatomic gas, we may point out first of all that, just as monatomic gases can 
reasonably be considered only for temperatures T' which are small compared 
with the ionisation energy, a diatomic gas can be regarded as such only if T 
is small compared with the dissociation energy of the molecule.‘ This in turn 
means that only the lowest electronic state of the molecule need be retained 
in the partition function. 

Let us begin with the most important case, where the gas molecule in the 
lowest electronic state has neither spin nor orbital angular momentum about 
the axis (S = 0, A = 0); such an electronic term has, of course, no fine struc- 
ture. We must also distinguish molecules composed of unlike atoms (includ- 
ing different isotopes of the same element) from those composed of like 
atoms, since the latter case has certain specific properties. In the present sec- 
tion we shall assume that the molecule consists of unlike atoms. 

The energy level of a diatomic molecule is, to a certain approximation, the 
sum of three independent parts: the electron energy (which includes also the 
energy of the Coulomb interaction of the nuclei in their equilibrium position 
and will be measured from the sum of the energies of the separated atoms), 
the rotational energy, and the vibrational energy of the nuclei within the 
molecule. For a singlet electronic state, these levels may be written (see 


* As examples, the temperatures J4;.,/k for some diatomic molecules are H, 
52,000°, Ny 113,000°, O, 59,000°, Cl, 29,000°, NO 61,000°, CO 98,000°. 
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Quantum Mechanics, § 82) 
Eyx = €9+ho(v+ 3) +#K(K + 1)/21, (47.1) 


where éo is the electron energy, fw the vibrational quantum, v the vibrational 
quantum number, K the rotational quantum number (angular momentum of 
the molecule), /= m'r? the moment of inertia of the molecule (m’ = 
mM Ma/(my+ me) is the reduced mass of the two atoms and ro the equilibrium 
value of the distance between the nuclei). 

When the expression (47.1) is substituted in the partition function, the 
latter is resolved into three independent factors: 


Zz = e-0TZ ot Zvibs (47.2) 


where the “rotational” and “vibrational” sums are defined by 


Zrot = > (2K 4 1)e-PRU+ 271, (47.3) 
=O 

Ziv =F. e~MMoHHIT, (47.4) 
vo=0 


the factor 2K +1 in Z,,, taking account of the degeneracy of the rotational 
levels with respect to the orientations of the angular momentum K. Accord- 
ingly, the free energy is the sum of three parts: 

N \2nh? 


3/2 
F =—NT log | ) + Frot + Frin + Neo, (47.5) 


ev ( mT 
where m = m+ mz is the mass of the molecule. The first term may be called 


the ¢ranslational part F,, (since it arises from the degrees of freedom of the 
translational motion of the molecules), and 


F si = —NT log Lov Fviv = —NT log Zyib (47.6) 


the rotational and vibrational parts. The translational part is always given 
by a formula of the type (43.1) with a constant specific heat c, = % and 
chemical constant 
3 m 
Cr = > log rh? . (47.7) 


The total specific heat of the gas is the sum of several terms: 


Co = Ctr + Crot +Cvin > 


47.8 
Cp = Cer + Crot + Crip + 1, ( t ) 
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which arise respectively from the thermal excitation of the translational 
motion of the molecule, its rotation and the vibrations of atoms within 
the molecule. 

Let us next calculate the rotational free energy. If the temperature is so 
high that T > #2/2] (i.e. the “rotational quantum” 4?/2/ is small compared 
with T),' then the terms with large K are the most important in the sum (47.3). 
For large values of K, the rotation of the molecule is quasi-classical. In this 
case, therefore, the partition function Z,,, can be replaced by the corre- 
sponding classical integral: 


Zrot = f eM derot (47.9) 


where «(M) is the classical expression for the kinetic energy of rotation as 
a function of the angular momentum M. Using a system of coordinates &, 
n, ¢ rotating with the molecule, with the ¢-axis along the axis of the molecule, 
and bearing in mind that a diatomic molecule has two rotational degrees of 
freedom and the rotational angular momentum of a linear mechanical system 
is perpendicular to its axis, we can write 


e(M) = (M?+M3)/21. 


The element dz,,, is the product of the differentials dM,, dM, and the differ- 
entials dd,, dd, of the “generalised coordinates” corresponding to M,, M, 
(i.e. the infinitesimal angles of rotation about the & and 7 axes), divided by 
(2xh)?.t The product of two infinitesimal angles of rotation about the § and 
n axes is just the element of solid angle do, for the direction of the third axis 
¢, and integration over the solid angle gives 4. Thus! 


4x aa 1 
Zrot = (2h? | | exp | — arp e+ | dM, dM, = 21T/h’. 


Hence the free energy is 


Fi. = -NT log T—NT log (21/f?). (47.10) 


t In practice this condition is always satisfied for all gases except the two iso- 
topes of hydrogen. As examples, the values of #?/2k/ are: Hy 85.4°, D, 43°, HD 64°, 
Ng 2.9°, Op 2.1°, Cl, 0.36°, NO 2.4°, HCI 15.2°. 

¥ It must be remembered that this notation is to some extent arbitrary, since 
d@, and d@, are not total differentials of any function of the position of the axes. 

§ This value of Z,,, can also be derived in another way: assuming that the num- 
bers K in the sum (47.3) are large and replacing the summation by integration with 
respect to K, we have 


Zrot & f 2Ke—KMAIT AK = 2TI/#?. 
0 
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Thus, at the relatively high temperatures under consideration, the rotation- 
al part of the specific heat is a constant, c,,, = 1, in accordance with the 
general results of the classical treatment in § 44 (4 for each rotational degree 
of freedom). The rotational part of the chemical constant is ¢,., = log (2I/#). 
We shall see below that there is a considerable range of temperatures over 
which the condition T >> #?/27 holds and at the same time the vibrational 
part of the free energy, and therefore the vibrational part of the specific heat, 
are zero. Over this range the specific heat of a diatomic gas c, = Cy +, 4) 1-€. 


C= a G= (47.11) 
and the chemical constant € = €.+€.44: 
€ = log [(27/f5) (m/2x)®?}. (47.12) 


In the opposite limiting case of low temperatures, T <« h?/2/, it is sufficient 
to retain the first two terms of the sum: 


Zrot = 1+3e-/7, 


and for the free energy we have in the same approximation 


Frot = —3NTe—*#!'7 (47.13) 
Hence the entropy is 
3NR2 
Sie = IT e~PIT (1 4- IT /h?) (47.14) 
and the specific heat is 
Crot = SN (HP? /IT)? e~ #7, (47.15) 


Thus the rotational entropy and specific heat of the gas tend to zero essen- 
tially exponentially as T -- 0. At low temperatures, therefore, a diatomic gas 
behaves like a monatomic one; both the specific heat and the chemical con- 
stant have the same values as in a monatomic gas of particles of mass m. 

In the general case of arbitrary temperatures the sum Z,,, must be cal- 
culated numerically. Fig. 4 shows c,,, as a function of 277/h?. The rotational 
specific heat has a maximum of 1.1 at T = 0.81(#?/2/), and then tends asymp- 
totically to the classical value 1.7 


t An asymptotic expansion of the thermodynamic quantities for large values of 
2T1/h® may be obtained. The first two terms of the expansion for the specific 
heat are 


1/2 
Crot = sari) . 


It must be remembered, however, that this expansion gives only a poor approx- 
imation to the function c,o(T). 
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§ 48. Diatomic gases with molecules of like atoms. Rotation of molecules 


Diatomic molecules consisting of like atoms have certain specific proper- 
ties which necessitate changes in some of the formulae derived in § 47. 

First of all, let us consider the limiting case of high temperatures, where 
a classical treatment is possible. Since the two nuclei are identical, two oppo- 
site positions of the axis of the molecule (differing only in that the two nuclei 
are interchanged) now correspond to the same physical state of the molecule. 
The classical partition function (47.9) must therefore be halved, and so the 
chemical constant becomes 


Crot = log (I/h?); (48.1) 


accordingly the factor 2 disappears from the argument of the logarithm in the 
sum €,,+¢,o¢ (47.12). 

More important changes are needed at temperatures where the quantum 
treatment has to be used. Since in practice the entire problem is of interest 
only in its application to the two isotopes of hydrogen (Hz and Dz), we shalt 
consider these gases in what follows. The requirement of quantum-mechani- 
cal symmetry in the nuclei (see Quantum Mechanics, § 86) results in the 
electronic state *Z+ (the ground term of the hydrogen molecule) having 
rotational levels of different nuclear spin degeneracy for even and odd values 
of K: levels with even and odd K respectively occur only for even and odd 
total spin of the two nuclei, and have relative degrees of degeneracy &, = 
if(2i +1), g, = @+1)/(2i +5) for a half-integral spin 7 of the nuclei, and g, = 
(i+1)/(27 +1), g, = i/(2i+1) for integral i. For hydrogen there is an accepted 
terminology whereby the molecules in states of greater nuclear statistical 
weight are called orthohydrogen molecules, and those in states of smaller 
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statistical weight are called parahydrogen molecules. Thus for the He and D2 
molecules the statistical weights are 


ortho g,= 
para = & 


3 
i = DeG= D| 


49 


ortho g,= 
para gy = 


2 
3° 

i 
3 


The suffix g denotes that the molecule has an even total nuclear spin (0 for 
He, 0 or 2 for De) and even rotational angular momenta K; the suffix u 
signifies odd total nuclear spins (1 for He and De) and odd values of K. 

Whereas, in molecules with unlike nuclei, the nuclear degrees of degenera- 
cy of all the rotational levels are the same, and so the allowance for this degen- 
eracy simply gives an unimportant change in the chemical constant, here it 
causes a change in the form of the partition function, which must now be 
written’ 


Zrot = 8eLZg +2uZus (48.2) 
where 
2; = 3 (2K + l)e~#KK+)/2I7 | 
K=0,%, ... (48.3) 
Les » (2K + l)em#KK+1 227 , : 
K=1,3,... 
Similarly the free energy becomes 
Frot = —NT log (2,25 + SuZu)s (48.4) 


and the remaining thermodynamic quantities are likewise changed. At high 


temperatures, 
Ze % Zu = tZror = TI/h?, 


so that the previous classical expression is obtained for the free energy, as it 
should be. 

As T + 0 the sum Z, tends to unity and Z, tends exponentially to zero; 
at low temperatures, therefore, the gas behaves as if monatomic (the specific 
heat c,,, = 0) and the chemical constant simply contains a nuclear part 
Cauc = log &;- 

The above formulae relate, of course, to a gas in complete thermal equi- 
librium. In such a gas the ratio of the numbers of molecules of parahydrogen 
and orthohydrogen is a definite function of temperature, which from the 
Boltzmann distribution is 


xH, = Northo—H,/Npara—H, = Sulul&gZ@e = 3Z,,/Z,, 


(48.5) 
1/xp, ad Northo—D,/Npara—D; aa 8eZg/Bulu = 2Z4/Zy. 
t The normalisation of the nuclear statistical weights which we use (such that 
&g+&, = 1) signifies that the entropy is measured from log (27+ 1)2, in accordance 
with the condition stated at the end of § 46. 
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As the temperature varies from 0 to ©, the ratio xy,, varies from 0 to 3, 
and Xp, from 0 to + (at 7’ = 0 all the molecules are, of course, in the state 
with the lowest value of K, namely K = 0, corresponding to pure para-He 
and ortho-Dz). 

It must be borne in mind, however, that the probability of a change in the 
total nuclear spin in a collision between molecules is very small. The mole- 
cules of orthohydrogen and parahydrogen consequently behave practically as 
different modifications of hydrogen and are not‘ converted into each other. 
In practice, therefore, we are concerned not with a gas in equilibrium but with 
a non-equilibrium mixture of the ortho and para modifications, the relative 
amounts of which have given constant values.t The free energy of such a 
mixture is equal to the sum of the free energies of the two components. 

In particular, for x = e (pure ortho-He or para-De) we have 


Fyot = —NT log (guZu)- 


At low temperatures (47/277 >> 1) only the first term in the sum need be 
retained in Z,, so that Z, = 3e~”*"", and the free energy is 


Fro = Nh? /I—NT log (3g,). 


This means that the gas will behave as if monatomic (c,,, = 0), the chemical 
constant including an additional term log (3g,), and the energy a constant 
term N#?/T, corresponding to the rotational energy of all the molecules, with 
Kh, 


§ 49. Diatomic gases. Vibrations of atoms 


The vibrational part of the thermodynamic quantities for a gas becomes 
important at considerably higher temperatures than the rotational part, 
because the intervals in the vibrational structure of the terms are large com- 
pared with those in the rotational structure.’ 

We shall suppose, however, that the temperature is not large enough to 
excite the very high vibrational levels. Then the vibrations are small, and 
therefore harmonic, and the energy levels are given by the usual expression 
hoo{v +4) as in (47.4). 

The calculation of the vibrational partition function Z,,, (47.4) is ele- 
mentary. Owing to the very rapid convergence of the series, the summation 


t In the absence of suitable catalysts. 

t For an ordinary gas which has been at room temperature for a considerable 
time the ratios are xy, = 3, xp, = +. 

§ As examples, the values of /two/k for some diatomic gases are H, 6100°, Ng 
3340°, Og 2230°, NO 2690°, HCI 4140°. 


il 
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may be formally extended to v = oo. We shall measure the energy of the 
molecule from the lowest vibrational level (v = 0), i.e. include 24m in the 
constant 9 in (47.1). Then 


Zuty = Ye hoo'T = 1/(1—e-M't), 
vo=0 


and hence the free energy is 


Fyip = NT log (1—e-*"7), (49.1) 
the entropy 
Sviv = —N log (1—e*”’7) + Nitw/T(e*/7 — 1), (49.2) 
the energy 
Eviy = Nho/(e'7 — 1), (49.3) 
and the specific heat 
Cvib = (F) eoray : (49.4) 


Figure 5 shows c,;, as a function of T/hw. 


At low temperatures (fio >> T) all these quantities tend exponentially to 


ZETO: 


(49.5) 
Cvib = (fiw/T} eh!" 
At high temperatures (fw < T) we have 
Fyiy = —NT log T +NT log (ho) —N- iho, (49.6) 


corresponding to a constant specific heat c,,, = 1‘ and a chemical constant 


+ Again in accordance with the classical results of § 44. 
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cvip = —log (Aw). Adding these to the values (47.11), (47.12), we find that 
at temperatures T >> Aw the total specific heat of a diatomic gas is’ 


Cy = 4, Cp = 4, (49.7) 


and the chemical constant is 
20 ( m \3 
t = log Ea eS, iF (49.8) 


the factor (2) must be omitted for molecules consisting of like atoms. The 
first two terms in the expansion of E,,, are 


Evin = NT—4Nhoo. (49.9) 


The constant term —+Nhw appears here because the energy is measured from 
the lowest quantum level (i.e. from the energy of the “zero-point” vibra- 
tions), whereas the classical energy would have to be measured from the 
minimum of the potential energy. 

The expression (49.6) for the free energy can also be derived classically, of 
course, since for J >> fw the important quantum numbers v are the large 
ones, where the motion is quasi-classical. The classical energy of small oscil- 
lations of frequency « is 


1 
Evio( DP, J) = +. eae zm ‘og? 


where m’ is the reduced mass. The integration with this expression for e gives 
for the partition function 


Zuo = = | | e~*wT do dg = T/ha, (49.10) 


which corresponds to (49.6);+ owing to the rapid convergence of the integral 
the integration with respect to gq may be taken from — © to oo. 

At sufficiently high temperatures, when vibrations with large v are excited, 
the anharmonicity of the vibrations and their interaction with the rotation of 
the molecule may become important. These effects are in principle of the 
same order of magnitude. Since v is large, the corresponding correction to 
the thermodynamic quantities may be determined classically. 


* As Fig. 5 shows, cy; actually approaches its limiting value of 1 when T = fw; 
for T/hw = 1, cy, = 0.93. As a practical condition for the applicability of the 
classical expressions we may write T >> fiw/3. 

t The same result is obtained on replacing the summation over v by an integra- 
tion. 


11* 
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Let us consider a molecule as a mechanical system of two particles inter- 
acting in accordance with the law U(r) in a coordinate system in which their 
centre of mass is at rest. The energy (Hamiltonian) which gives a precise 
classical description of the rotation and vibrations of the system is the sum 
of the kinetic energy (the energy of a particle with the reduced mass m’) and 
the potential energy U(r). The partition function, after integration over the 
momenta, reduces to an integral over the coordinates: f e~ YT AV, and 
after integration over the angles (in spherical polar coordinates) there 
remains the integral 


f eYety dr. 
0 


The approximation corresponding to independent harmonic vibrations and 
rotation of the molecule is obtained by putting U(r) = Us+4m'w(r—ro)? 
and, in the integration, replacing the slowly varying factor r* by 72, where ry 
is the equilibrium distance between the particles: Up = U(ro). In order to 
take into account the anharmonicity of the vibrations and their interaction 
with the rotation we now write 


U(r) = Up + dm’ wri(E?— ak? + Be), (49.11) 


where & = r/ro—1, and « and £ are constants,’ and then expand the whole 
integrand in powers of &, separating the factor exp {—(Up +4m'w*r5é")/T}. 
In the expansion, only those terms need be retained which after integration 
give the highest and next highest powers of the temperature; the integration 
over £ is taken from — © to co. The zero-order term in the expansion gives 
the usual value of the partition function, and the remaining terms give the 
required correction. Omitting the calculations, we shall state the final result 
for the correction to the free energy: 


et 43-3 8 +->a8l, (49.12) 
8 m | 


Thus the anharmonicity of the vibrations and their interaction with the rota- 
tion give a correction to the free energy which is proportional to the square of 
the temperature. Accordingly the specific heat has a further term proportion- 
al to the first power of the temperature. 


§ 50. Diatomic gases. The effect of the electronic angular momentum 


Some types of molecule, though not many, have a non-zero orbital angular 
momentum or spin in their electronic ground state. 


+ These constants can be expressed in terms of the spectroscopic constants of 
the molecule; see Quantum Mechanics, § 82. 
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The presence of a non-zero orbital angular momentum A causes a twofold 
degeneracy of the electronic term, corresponding to the two possible direc- 
tions of this angular momentum with respect to the axis of the molecule.t 
This affects the thermodynamic quantities: because of the doubling of the 
partition function, a quantity 


Ca = log2 (50.1) 


is added to the chemical constant. 

The presence of a non-zero spin S causes a splitting into 2S +1 levels, but 
the intervals in this fine structure are so small (when A = 0) that they can 
always be neglected in calculating the thermodynamic quantities. The pres- 
ence of the spin simply increases the degree of degeneracy of each level by a 
factor 2S +1, and so the chemical constant is increased by 


ts = log (2S+1). (50.2) 


The fine structure which occurs when S # 0, A ¥ 0 requires special con- 
sideration. Here the fine-structure intervals may reach values which have to be 
taken into account in calculating the thermodynamic quantities. We shall 
derive the formulae for the case of a doublet electron term.t Each component 
of the electron doublet has its vibrational and rotational structure, the param- 
eters of which may be regarded as the same for each component. The 
partition function (47.2) therefore contains a further factor 


Zeal = So+gie~4/7, 


where go, g1 are the degrees of degeneracy of the components of the doublet, 
and 4 their separation. The free energy must accordingly contain an “elec- 
tronic part” 


Fa = —NT log (g0 +21e—4'7). (50.3) 


t Strictly speaking, the term is split into two levels (A-doubling), but the sepa- 
ration between these is so small that it may be entirely neglected here. 


t This case occurs for NO; the electronic ground state of the NO molecule is 
the doublet 71/2, 3/. with width 4 = 178°. Each component of the doublet is 
doubly degenerate. 

An unusual case occurs for oxygen. The electronic ground state of the O, molecule 
is a very narrow triplet 82’, the width of which may be neglected, but it happens 
by chance that the next (excited) state 14 (doubly degenerate) is relatively near, 
at 4 = 11,300°, and at high temperatures it may be excited, with a consequent 
effect on the thermodynamic quantities. 
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We may also give the “electronic” specific heat which must be added to the 
other parts of the specific heat: 
—_ (4(ry? 
[LF (go/ene47 1 + (ei/go)e-47 I 


In the limits T — 0 and T + -, c,, is of course zero, and it has a maximum 
at some temperature T ~ A. 


(50.4) 


PROBLEM 


Determine the correction to the free energy for oxygen due to the first excited 
electronic state of the O, molecule (see the last footnote). The temperature is large 
compared with the vibrational quantum, but small compared with the distance A 
between the ground state 32’ and the excited state 1. 


SOLUTION. The partition function is 


T TI T TI 
Besa Sp aarae 
joo Bt i’ B 


where the two terms on the right are the partition functions for the ground and 
excited states, each of which is the product of electronic, vibrational and rotational 
factors. The required correction to the free energy is therefore. 


2wr'? 
Fi, = —NT log (1 9 ¢~4T\ ~ —NT- ——® e-AT, 


where , ’9,’, rg are the frequencies and equilibrium distances between the nuclei 
in the ground and excited electronic states. 


§ 51. Polyatomic gases 


The free energy of a polyatomic gas, like that of a diatomic gas, can be 
written as the sum of translational, rotational and vibrational parts. The 
translational part, as before, is characterised by values of the specific heat 
and chemical constant 


Cr = 3, be = Slog (m/2nf?). (51.1) 


Owing to the large moments of inertia of polyatomic molecules (and the 
corresponding smallness of their rotational quanta) their rotation may always 
be treated classically.’ The polyatomic molecule has three rotational degrees 


+ Rotation quantisation effects would be observable only in methane CHy,, where 
they should occur at temperatures of about 50°K; see the Problem at the end of this 
section. 
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of freedom and three principal moments of inertia 1), 72, Js, which are in 
general different; its kinetic energy of rotation is therefore 


Mj, M3 , MP 


Erot or, t Wh, Wo +37,” (51.2) 


where &, 7, ¢ are coordinates in a rotating system whose axes coincide with 
the principal axes of inertia of the molecule; for the present we disregard 
the special case of molecules consisting of collinear atoms. This expression is 
to be substituted in the partition function 


Zror = [ em¥T Atror, (51.3) 
where 


dtrot = dM, dM, dM; dd; dd, dé;, 


ana 
and the prime denotes, as usual, that the integration is to be taken only over 
the physically different orientations of the molecule. 

If the molecule has axes of symmetry, rotations about these axes leave the 
molecule unchanged, and amount to an interchange of identical atoms. It is 
clear that the number of physically indistinguishable orientations of the mole- 
cule is equal to the number of possible different rotations about the axes of 
symmetry, including a rotation through 360° (the identical transformation). 
Denoting this number’ by a, we can take the integration in (51.3) simply over 
all orientations and divide by o. 

In the product dé, d@, dd, of three infinitesimal angles of rotation, 
dd, dd, may be regarded as an element do, of solid angle for directions of the 
¢-axis. The integration over o, is independent of that over rotations d@, 
about the ¢-axis, and gives 42. The integration over ¢, gives a further 27. 
Integrating also over oe M,, M, from — co to oo, we finally have 


Zrot = (nT )* (LsTals)' = (27) (sel als)! oh. 


oa 
Hence the free energy is 


1/2 
F = — 3 NT log T—NT log Corley ‘ 


5 (51.4) 


Thus we have for the rotational specific heat, in accordance with § 44, 


Ces (51.5) 


t For instance, in H,O (an isosceles triangle) 0 = 2, in NHg (an equilateral tri- 
angular pyramid) o = 3, in CH, (a tetrahedron) o = 12, and in C,H, (a regular 
hexagon) o = 12. 
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and the chemical constant is 
SacJyIels3)¥2 . 
b= op ele ve) (51.6) 


For a linear molecule, i.e. one where all the atoms are collinear, there are, 
as in the diatomic molecule, only two rotational degrees of freedom and one 
moment of inertia 7. The rotational specific heat and the chemical constant 
are, as in a diatomic gas, 


Cro = 1, Cras = log (21/oh?), (51.7) 


where o = 1 for an asymmetric molecule (such as NNO) and o = 2 for a 
molecule symmetrical about its midpoint (such as OCO). 

The vibrational part of the free energy of a polyatomic gas is calculated 
in a similar way to that for a diatomic gas, given above. The only difference 
is that a polyatomic molecule has not one but several vibrational degrees of 
freedom: a non-linear molecule of 7 atoms clearly has r,,, = 3n—6 vibra- 
tional degrees of freedom, while for a linear molecule of atoms r,,, = 
3n—5 (see § 44). The number of vibrational degrees of freedom determines 
the number of normal modes of vibration of the molecule, to each of which 
there corresponds a frequency w, (the suffix « numbering the normal modes). 
-It must be remembered that some of the frequencies w, may be equal, in 
which case the frequency concerned is said to be degenerate. 

In the harmonic approximation, where the vibrations are assumed small 
(only temperatures for which this is so will be considered), all the normal 
modes are independent, and the vibrational energy is the sum of the energies 
of the individual modes. The vibrational partition function therefore falls 
into a product of partition functions of the individual modes, and the free 
energy F,,, is a sum of expressions of the type (49.1): 


Fup = NT Y. log (1—e~#oeT), (51.8) 


Each frequency appears in this sum a number of times equal to its degeneracy. 
Similar sums are obtained for the vibrational parts of the other thermody- 
namic quantities. 
Each of the normal modes gives, in its own classical limit (T > ho,), 
a contribution c% = 1 to the specific heat; for 7’ greater than the greatest 
hoo, we should obtain . 
Cvib = Tvib- ~ (51.9) 


In practice, however, this limit is not reached, since polyatomic molecules 
usually decompose at considerably lower temperatures. 

The various frequencies w, for a polyatomic molecule generally range over 
a very wide interval. As the temperature increases, the various normal modes 
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successively contribute to the specific heat. In consequence the specific heat 
of polyatomic gases may often be regarded as approximately constant over 
fairly wide. intervals of temperature. 

We may mention the possibility of a curious change from vibration to 
rotation,.an instance of which is afforded by the ethane molecule C2H¢. This 
molecule consists of two CHs groups at a certain distance apart and oriented 
in a certain way to each other. One of the normal vibrations of the molecule is 
a “torsional” vibration, in which one of the CHs groups is twisted relative to 
the other. As the energy of the vibrations increases, their amplitude increases 
and ultimately, at sufficiently high temperatures, the vibration becomes a free 
rotation. The contribution of this degree of freedom to the specific heat, 
which is approximately 1 when the vibrations are fully excited, therefore 
begins to decrease as the temperature increases further, approaching asymp- 
totically, the value 4 typical of a rotation. 

Finally, it may be mentioned that, if the molecule has a non-zero spin S 
(for example, the molecules NOg and ClO), the chemical constant includes 


a term 
= log (25 +1). (51.10) 


PROBLEM 
Determine the rotational partition function for methane at low temperatures. 


SOLUTION. As already mentioned in the first footnote to this section, a quantum 
calculation of Z,,., for methane is required at sufficiently low temperatures. 

The CH, molecule is a tetrahedron of the spherical-top type, and so its rotational 
levels are #*J(J+1)/2/, where J is the common value of the three principal mo- 
ments of inertia, and J the rotational quantum number. Since the spin i of the 
H nucleus is 4 and that of the C!? nucleus is zero, the total nuclear spin of the 
CH, molecule may be 0, 1 or 2, the corresponding nuclear statistical weights being 
1, 3 or 5; see Quantum Mechanics, § 105, Problem 5. For any given value of J 
there are definite numbers of states corresponding to values of the total nuclear 
spin. The following table gives these numbers for the first five values of J. 


Nuclear spin 0 1 2 
J=0 — — 1 
1 -— 1 — 

2 2 1 — 

3 —_ 2 1 

4 2 2 1 


The value of the sum Z,,, which is obtained by taking into account the total 
degree of degeneracy with respect to orientations of the rotational angular mo- 
Mentum and nuclear spin must be divided by 16 if the entropy is to be measured 
from the value log (2/+-1)* = log 16 (cf. the first footnote to § 48). The result is 


- $5 9 25 77 _. 117 
= 9 AIT LT ep -8AS/IT Lg — BAIT 1, gp —10K8/IT 
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§ 52. Magnetism of gases 


A body in an external magnetic field H is characterised by a further macro- 
scopic quantity, the magnetic moment Yt which it acquires in the field. 
For an ideal gas, this is §t = Nim (where wt is the mean magnetic moment 
of an individual atom or molecule), and so its calculation calls for a consid- 
eration only of the behaviour of these individual particles of the gas in the 
magnetic field. Note also that, since the magnetisation of a rarefied medium 
such as a gas is small in proportion to its density, we can neglect the influence 
of the medium on the field, ie. assume that the field acting on each particle 
is just the external field H. 

The change in the Hamiltonian of the gas corresponding to a small change 
6H in the external field is 6 = —M-dH, where M is the operator of the 
magnetic moment of the gas. According to (15.11) (cf. also (11.4)), in which 
the external parameter A is now to be taken as the field H, we therefore have 


M = —(OF/OM)z yn. (52.1) 


In calculating the free energy of the gas in the magnetic field, we must 
first determine the corrections, due to the field, to the energy levels of the gas 
particles. Let us first assume that the gas is monatomic. The Hamiltonian of 
an atom in a magnetic field is 


H= Bo—-h-H+ 


—_ 2, (HX Iq), (52.2) 


where Ay is the Hamiltonian of the atom in the absence of the field, e and m 
the electron charge and mass, r, the electron coordinates (the summation 
being over all the electrons), m =—f(2S+L) is the operator of the “in- 
trinsic” magnetic moment of the atom (§ and L being its spin and orbital 
angular momentum operators), and 6 = |e|#/2mc is the Bohr magneton 
(see Quantum Mechanics, § 113). Regarding the second and third terms in 
(52.2) as a small perturbation relative to Ho, we can determine the correction 
to the energy levels as far as the terms quadratic in the field. It is 


Ae, = &—& = —A,H—+B,H’, (52.3) 


+ In classical mechanics, the small change in the Lagrangian of a system of parti- 
cles due to a change 6H in the field is dL = M(q, g)-OH, where M(q, g) is the mag- 
netic moment of the system as a function of its dynamical variables, the coordinates 
and velocities: see Fields, (45.3). The change in the Hamiltonian for given coordi- 
nates q and momenta p differs from dL only in sign (see Mechanics, (40.7)): 6H = 
—M(g, p)-OH. In quantum mechanics there is a corresponding expression for 
the change in the Hamiltonian, M being the magnetic moment operator expressed 
in terms of the coordinates and momentum operators (and spins) of the particles. 
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where 
Ax = (Me)kks (52.4) 
|(ms ke? 
3B, =2y)——_- — Ft Paks 52.5 
k >» ame? XL +Yakk (52.5) 


and the z-axis is along H; the first term in (52.5) results in the second order 
of perturbation theory from the term linear in H in (52.2), and the second 
term in the first order, from the quadratic term in the Hamiltonian. 

In calculating the free energy, we shall assume that the gas temperature 
is not too low, so that 4e, < T. Then the partition function can be expanded 
in powers of H, and as far as the quadratic terms we have 


| = Sel = Sekt | Ae, ARH” | BH? 
Z= ye ue j1+ T + TE oT I 


The summation over k includes in particular an averaging over directions of 
the intrinsic magnetic moment mt of the atom (on which the unperturbed 
levels do not depend); from symmetry it is evident that the mean value A is 
zero, leaving 


= H? Wi — O/T 
z= {147 (Fra) pee”, 
where the bar denotes averaging over the Boltzmann distribution unper- 
turbed by the field. Substituting this expression in (41.4) and then differen- 


tiating the free energy with respect to H, we obtain the magnetic moment in 
the form 9 = NyH, where 


= 
= pAt+B (52.6) 


is the molecular magnetic susceptibility of the gas (J. H. Van Vleck, 1927). 
Let us consider some particular cases. 

We shall suppose that the temperature T is small in comparison with the 
interval between the ground state and even the nearest excited level (including 
the fine-structure components of the ground state). Then we can say that 


only the ground state of the atom (k = 0) contributes to the mean values A 
and B. 

In the simplest case, where the atom (in the ground state) has neither spin 
nor orbital angular momentum (for example, the atoms of the inert gases), 
all matrix elements of the intrinsic magnetic moment of the atom are also 
zero. Then Ao = 0, and only the second term in Bo is non-zero. Because the 
wave function of a state with L = S = 0 is spherically symmetrical, the 
diagonal matrix elements (i.e. the mean values with respect to the state of 
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the atom) are (x2)oo = (Yz)o0 = 4(77)oo- The result is 


1=- émct 2 adoo» (52.7) 


i.e. the gas is diamagnetic and its susceptibility is independent of the temper- 
ature (P. Langevin, 1905).* 

However, if the intrinsic magnetic moment of the atom is not zero, then 
Ao # 0 and (with the assumption made above about the temperature) the 
first term. in (52.6) is large compared with the second term. A calculation 
according to the definition (52. 4) gives 


Ao=—PgMy, g= please). 


where g is the Landé factor and M, the projection of the total angular 
momentum J of the atom (see Quantum Mechanics, § 113). The averaging in 
(52,6) is equivalent to averaging over the values of M,. Since 


oe J 1 
Mj} = ~—— 2d  Ma=ZIUtD, 
we find 


=FE Tey. (52.8) 


Thus the gas is paramagnetic and its susceptibility follows Curie’s law, 
being inversely proportional to the temperature (P. Langevin, 1905). 


t Note that this diamagnetism, already mentioned in Quantum Mechanics, 
§ 113, is a quantum effect. Although the quantum constant does not appear expli- 
citly in (52.7), it in fact determines the “size” of the atom. In classical statistics, 
there are no macroscopic magnetic properties of matter, since in classical mecha- 
nics the Hamiltonian function of a system in a magnetic field differs from that in 
the absence of the field only in that the particle momenta pare replaced by P— 
eA(r)/c, where P are the generalised momenta and A(r) is the vector potential 
of the field. In the partition function, the integration is over all momenta P (and 
coordinates r). After a change of variables, with integration over p= P—eA/c 
instead of over P, we find that the partition function does not involve the magnetic 
field, and this therefore does not appear in any of the thermodynamic quantities: 


+ Formula (52.8) can be applied not only to a gas, but also to a solid or a liquid 
where the magnetic moments of the atoms can for any reason be regarded as 
“free”. This is true, for instance, of the magnetism of rare-earth elements in solid 
salts and in solutions. The paramagnetism of these ions is due to the incomplete 
4f shell. These relatively deep electrons are screened from the influence of neigh- 
bouring atoms by electrons further out, and the ions can therefore behave mag- 
netically like atoms of a rarefied gas. 
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"If the orbital angular momentum and the spin of the atom are not zero, 

but equal-in magnitude (L = S # 0), and combine to give a total angular 
momentum J = 0, then the diagonal matrix elements of the intrinsic mag- 
netic moment are zero, whereas the non-diagonal ones (for transitions 
L, 8S, J + L, S, J+1 within a multiplet) are not zero. Then Ao = 0, and the 
second (diamagnetic) term in Bo (52.5) is small compared with the first term, 
whose denominators contain the relatively small spacings of the ground-state 
fine structure. Here By > 0: for the ground state in each term of the sum 
over k’, both the numerator and the denominator are positive. Thus in this 
case the gas is paramagnetic and its susceptibility is independent of the tem- 
perature: y = Bo (J. H. Van Vleck, 1928).* 
_ The magnetic susceptibility of molecular gases may be calculated similarly. 
At ordinary temperatures the rotation of the molecules is classical. The 
matrix elements of the magnetic moment may therefore be calculated first 
for fixed nuclei, and the averaging over orientations of the molecule then 
carried out as if it were a rigid classical magnetic dipole (see the following 
Problems).* 


PROBLEMS 


PROBLEM 1. Determine the magnetic susceptibility of a monatomic gas when 
the fine-structure spacing of the ground state of the atom is small in comparison 
with 7. 


SOLUTION. In this case the averaging in (52.6) must be taken over all the com- 
ponents of the ground state multiplet of the atom, and the Boltzmann factors 
exp (—e/T) may be taken as equal for all components. Then 


Ae = |(JM,|m,| JM;)|*, 


where the averaging is over all values of J and M, (for given values of S and L). 
The result of such an averaging, however, does not depend on whether it is carried 
out before or after the angular momenta Sand L are added to give J. It may there- 
fore be calculated as 


( A® = |(MpMgs|m,| MpMsg) |? 
with independent averaging over M;, and Ms. Since 
MsM, = My M, =0, M2= FS(S+1), MP = $L(L+1), 


we find = 
At = PU4S(S + 1)+-L(L+ 1]. 


' t This case occurs for the Eu++* ions in europium salts (see the previous foot- 
note). 

t The magnetic moment arising from the motion of the nuclei is very small in 
comparison with: the electronic manent moment, and may therefore always be 
neglected. 
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In the expression (52.5) for A, the second term may be neglected. The first term 
(which might be large because its denominators, the multiplet spacings, are small) 
gives zero on averaging over the components of the multiplet: in the sum 


, My \m,) J'M7)? 
py eP—e) ; 


which is now taken over all numbers J, J’, M,, Mj, the terms which differ by the 
interchange of J and J’ cancel. The susceptibility is therefore 


1= c [4S(S+1)+L(L + 1)]. 


PROBLEM 2, Determine the magnetic susceptibility of a diatomic gas when the 
fine-structure spacings of the electron ground state of the molecule are large in. 
comparison with 7.t 


SoLuTion. In this case it is sufficient to consider only the ground level of the 
molecule, i.e. the lowest component of the ground-state multiplet. The mean 
value of the magnetic moment of the molecule in a state where the projections of 
the orbital angular momentum and the spin on the axis of the molecule are 4 
and is 


(AE | m| AZ) = —fa( +25), 


where n is the unit vector along the axis of the molecule. For classical rotation, 
e= i and the magnetic susceptibility is 


Aap 
i= ap (At 2eP. 


PROBLEM 3. The same as Problem 2, but with the fine-structure spacings small 
in comparison with T (molecular term case b). 


SOLUTION. In this case the averaging must be taken over all components of the 
multiplet. The diagonal matrix elements of the z-component of the magnetic 
moment for given values of A and the spin z-component Msg are 


(AMg [m,| AMs) = —B(n,A+2Ms). 


Averaging the square of this over the values of Mg, and the directions of n, we find 
the susceptibility 


iar 
= SalA+4s(S+ NL 


PROBLEM 4. Determine the magnetic susceptibility of the gas NO. The electron 
ground term of the molecule is 2/7 (ie. A = 1, S = $),and the spacing A between 


* At ordinary temperatures the multiplet spacings are certainly large in compari- 
son.with the rotational structure of the levels, so that the molecular term belongs 
to case a coupling (see Quantum Mechanics, § 83). 
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the doublet components is comparable’ to the temperature T (J. H. Van Vleck, 
1928), 


SOLUTION. Here, in the averaging in (52.6), we have to take account of both 
components of the doublet level with different Boltzmann factors. The diagonal 
matrix elements of the magnetic moment for the two states | 12’) are 


(1, —4{L+28|1, —4) = 1n—2-$n = 0, 
(1, $/L+28| 1, +) = 2n. 
Hence 


Wi wd 46? e— UT 
~ 3 the 4 


The operator £ has non-zero matrix elements for transitions between these two 
states, since »' changes in the transition but not 4. The non-diagonal matrix ele- 
ments of the operator 28, are 


(1,4(25,11, —2)= U1, 425,11, 4) =—1-sin 9, 


where @ is the angle between n and the z-axis.+ According to (52.5) (where we 
again neglect the second term), 


the factor 2 coming from the averaging of sin? 9. The complete expression for the 
susceptibility becomes 
6? 


=F f(4/T), f@)= 


4[1—e-*(1—x)] 
L= oF > 


x(1+e7*) 


t It is 180°. The lower and upper doublet components correspond to axial 


spin components ' = —} and +4 respectively. The term is case a. 


t The operator § = +o, where @ represents the Pauli matrices with quantisation 
along the axis of the molecule, i.e. 


Ga) 
O-= 
e™ \o ~1 


if the coordinates are &, 7, ¢ with the C-axis along n. 


CHAPTER V 


THE FERMI AND BOSE DISTRIBUTIONS 


§ 53, The Fermi distribution 


Ir THE temperature of an ideal gas (at a given density) is sufficiently low, 
Boltzmann statistics becomes inapplicable, and a different statistics must be 
devised, in which the mean occupation numbers of the various quantum 
states of particles are not assumed small. 

This statistics, however, differs according to the type of wave functions by 
which the gas is described when regarded as a system of N identical particles. 
These functions must be either antisymmetrical or symmetrical with respect 
to interchanges of any pair of particles, the former case occurring for particles 
with half-integral spin, and the latter case for those with integral spin. 

For a system of particles described by antisymmetrical wave. functions, 
Pauli’s principle applies: in each quantum state there cannot simultaneously 
be more than one particle. The statistics based on this principle is called 
Fermi statistics, or Fermi-Dirac statistics." 

As in § 37, we shall apply the Gibbs distribution to the set of all particles 
in the gas which are in a given quantum state; as already mentioned in § 37, 
this may be done even if there is an exchange interaction between the par- 
ticles. We again denote by 2, the thermodynamic potential of this set of 
particles; by the general formula (35.3), 


Qy = —T log Y (eu—aT ye, . (53.1) 


since the energy of n, particles in the kth state is just ,e,. According to 
Pauli’s principle, the occupation numbers of each state can take only the 
values 0 and 1. Hence 


Q, =-T log a+ sory, 


Since the mean number of particles in the system is equal to minus the 
derivative of the potential 2 with respect to the chemical potential u, the 


tT It was proposed by EB. Fermi (1926) for electrons, and its relation to quantum 
mechanics was elucidated by P. A. M. Dirac (1926). : 
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required mean number of particles in the kth quantum state is here obtained 
as the derivative 


act. 02; = et eg)T 
(Sy = Tama 
or finally 
= I 
h= er ° (53.2) 


This is the distribution function for an ideal gas obeying Fermi statistics, 
or in short a Fermi gas. All the n, <1, as is to be expected. When 
et—W/T < 1, formula (53.2) tends to the Boltzmann distribution function, as 
it should. 

The Fermi distribution is normalised by the condition 


1 
y amore = (53.3) 
where N is the total number of particles in the gas. This equation implicitly 
determines the chemical potential as a function of T and N. 
The thermodynamic potential 2 of the gas as a whole is obtained by sum- 
mation of 2, over all quantum states: 


= any, log (1 +e%-®)T), (53.4) 


§ 54. The Bose distribution 


Let us now consider the statistics obeyed by an ideal gas consisting of 
particles described by symmetrical wave functions, namely Bose statistics or 
Bose-Einstein statistics.* 

The occupation numbers of the quantum states when the wave functions 
are symmetrical are unrestricted and can take any values. The distribution 
function may be derived as in § 53; we put 


Qy =—Tlog 3° (eu-wTY, 
0 


a= 
This geometric progression is convergent only if e“—/7 < 1, Since 


this condition must be satisfied for all e,, including ¢, = 0, it is clear that 


we must certainly have 
uw<0. (54.1) 


t This was introduced by S. N. Bose (1924) for light quanta, and generalised by 
Einstein. 
12 
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In this connection it may be recalled that for a Boltzmann gas the chem- 
ical potential always has large negative values; for a Fermi gas 4 may 
be either negative or positive. 

Summation of the geometric progression gives 


Q, = T log (1 —e@—/T), 


Hence we find the mean occupation numbers n, = —62,/du: 
=e 1 54 
= ee wF aT Cr) 


This is the distribution function for an ideal gas which obeys Bose statistics 
(or, as it is called for brevity, a Bose gas). It differs from the Fermi distri- 
bution function in the sign of unity in the denominator. Like that function, 
it tends of course to the Boltzmann distribution function when e“—/T << 1, 
The total number of particles in the gas is given by the formula 


1 
N=) ay (54.3) 


and the thermodynamic potential 2 of the gas as a whole is obtained by 
summation of 2, over all quantum states: 


Q = TY log (1-7), (54.4) 


§ 55. Fermi and Bose gases not in equilibrium 


As in § 40, we can calculate the entropy also for Fermi and Bose gases not in 
equilibrium, and again derive the Fermi and Bose distribution functions 
from the condition that the entropy be a maximum. 

In the Fermi case there can be no more than one particle in each quantum 
state, but the numbers N, are not small, and are in general of the same order 
of magnitude as the numbers G,. (The notation is as in § 40.) 

The number of possible ways of distributing N, identical particles among 
G, states with not more than one particle in each is just the number of ways 
of selecting N, of the G; states, i.e. the number of combinations of G, things 
N, at a time. Thus 

Ar, = G;!/N;! (G,;—N);)!. (55.1) 


Taking the logarithm of this expression and using for the logarithm of 
each factorial the formula (40.3), we find 


sa 2G; log Gj—Nj log Nj—(G;—Nj) log (Gj—Np}. (55.2) 
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Again using the mean occupation numbers of the quantum states, n, = N,/G), 
we finally have the following expression for the entropy of a Fermi gas not in 
equilibrium: 

S= => G,{nj log nj + (1 —n)) log (1—1n))). (55.3) 


From the condition for this expression to be a maximum according to 
(40.8) we easily find that the equilibrium distribution is given by the formula 


ny = 1/(er+4 +1), 


which is the Fermi distribution, as it should be. 

Finally, for Bose statistics, each quantum state may contain any number of 
particles, so that the statistical weight AI’, is the total number of ways of 
distributing N, particles among G;, states. This number is’ 


AD, = (Gj; +Nj—I'YM(Gj- D1 Nj! (55.4) 


Taking the logarithm of this expression and neglecting unity in comparison 
with the very large numbers G, +N, and G,, we obtain 


S= >, {(G; +N)) log (G,; +N;)—N; log N;—G; log G;}. (55.5) 


In terms of the numbers n, we can write the entropy of a Bose gas not in 
equilibrium as 
S= 1 Gl + nj) log (1 +-j))—n; log nj). (55.6) 


It is easily seen that the condition for this expression to be a maximum in 
fact gives the Bose distribution. 

The two formulae (55.2) and (55.5) for the entropy naturally tend, in the 
limiting case N, < G,, to the Boltzmann formula (40.4), and the statistical 
weights (55.1) and (55.4) for Fermi and Bose statistics tend to the Boltzmann 
expression (40.2); to see this, we must put G,! = (G,—N,)!GM", (G, +N,—1)! 


tf The problem is to find the number of ways of distributing N;, identical balls 
among G;, urns. Let us imagine the balls as a line of N, points, and number the urns; 
let us then imagine the latter to be separated by G,—1 vertical strokes placed at 
intervals along the line of points. For example, the diagram 


A) apace ll cee lass 


represents ten balls distributed among five urns: one in the first, three in the second, 
none in the third, four in the fourth and two in the fifth. The total number of 
places (occupied by points and strokes) in the line is Gj+N,—1. The required 
number of distributions of the balls among the urns is the number of ways of 
choosing G;—1 positions for the strokes, i.e. the number of combinations of N + 
G,—1 things G,—1 at a time, and this gives the result (55.4). 
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zz (G,—1)!G). It must be remembered, however, that, in going to the limit, 
terms of order N}/G, are neglected in the statistical weights, and these terms 
are not in general small; but when the logarithm is taken these terms give a 
correction to the entropy which is of the relatively small order NIG; 

Finally, we shall give a formula for the entropy of a Bose gas in the impor- 
tant limiting case where the number of particles in each quantum state is 
large (so that N, >> G,, n, >> 1). We know from quantum mechanics that this 
case corresponds to the classical wave picture of the field. The statistical 
weight (55.4) becomes 


AT, = Ny"(Gj—1)! - (55.7) 
and the entropy is ; . 
S = 2G; log (eN,/G)). : (55.8) 


We shall make use of this formula in § 71. 


§ 56. Fermi and Bose gases of elementary particles 


Let us consider a gas consisting of elementary particles, or of particles 
which under certain conditions may be regarded as elementary. As has 
already been mentioned, the Fermi or Bose distribution need not be used for 
ordinary atomic or molecular gases, since these gases are in practice always 
described with sufficient accuracy by the Boltzmann distribution. 

All the formulae derived in the present section are exactly similar in form 
for both Fermi and Bose statistics, differing only as regards one sign. The 
upper sign will always correspond to Fermi statistics and the lower sign to 
Bose statistics. 

The energy of an elementary particle is just the kinetic energy of its transla- 
tional motion, which is always quasi-classical. We therefore have 


e = (p2+p}+p2/2m, me (56.1) 


and in the distribution function we make the usual change to the distribution 
in the phase‘space of the particle. Here it must be borne in mind that, for a 
given value of the momentum, the state of the particle still depends on the 
orientation of its spin. Hence the number of particles in a volume element 
dp, dp, dp, dV in phase space is found by multiplying the distribution (53.2) 
or (54.2) by 

g dt = g dp, dp, dp, dV /(2xh)*, 


where g = 2s-+1 (s being the spin of the particle), giving 


g dt | | 
dN = ee-wIT ET . : Ho ash (56.2) 
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Integrating over V (which simply involves replacing dV by the total volume 
V of the gas) we find the distribution for the components of the particle 
momentum, using spherical polar coordinates in momentum space and 
integrating over angles, we find the distribution for the absolute magnitude 
of the momentum: .. 


_ __— gp dp 
ON = SRR ET)” (56.3) 


where ¢ = p?/2m, or the energy distribution 


geVm2 ede 


ONe= a Se-WT ET 


(56.4) 


These formulae take the place of the classical Maxwellian distribution. 
Integrating (56.4) with respect to «, we obtain the total number of particles 
in the gas: 


Ne eV? ( Ve de 
: QU2772h3 eG IT + | . 


0 


In terms of a new variable of integration z = e/T, this equation can be written 


N _ g(mT)? /z dz 
YV  DiR7 23 er—HIT 4] (08.9) 
This formula implicitly determines the chemical potential uw of the gas as a 
function of.its temperature J and density N/V. 

With the same change from summation to integration in formulae (53.4), 
(54.4), we find for the potential 2 the expression 


VeTmi? 
Q= ae [ Vlog (1te—T) de. 


Integration by parts gives 


2 gVmi? ( eF2 de 


3 Dia 2hs. eT 44° ES) 


Q= 


This expression is the same, apart from the factor — 2, as the total energy of 
the gas, 


oo oo 


3/2 3/2 
b= [ ean, = Fa | e3/8 de 


Dee | OTL et) 
8 
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Since 2 = — PV, we have therefore 
PY = 2E. (56.8) 


This result is exact, and so must hold good in the limiting case of a Boltz- 
mann gas also; and in fact, on substituting the Boltzmann value E = 3NT/2, 
we obtain Clapeyron’s equation. 

From formula (56.6), substituting «/T = z, we obtain 


Q =—PV = VT*2f(u/T), (56.9) 


where fis a function of a single variable, i.e. 2/V is a homogeneous function 
of order = in uw and T.* Hence 


epee aah a ee ele 
r~-r(ar),, ™ 77-7 (ar)ey 


are homogeneous functions of order = in w and 7, and their ratio S/N isa 
homogeneous function of order zero, i.e. S/N = $(u/T). Hence we see that 
in an adiabatic process (S = constant) the ratio u/T remains constant, and 
since N/VT®* is also a function of /T only we have 


VT? = constant. (56.10) 
Then (56.9) shows that 
Py 5 = constant, (56.11) 


and also T°?/P = constant. These equations are the same as that of the 
Poisson adiabatic (43.9) for an ordinary monatomic gas, but it must be 
emphasised that the exponents in (56.10), (56.11) are here unrelated to the 
ratio of specific heats, since the relations c,/c, = = and c,—c, = 1 are 
not valid. 
Formula (56.6), in the form 

go QU BAT 5/2 232 dz 

P= 30H 7 Cel) 


4 


0 


together with (56.5) determines the equation of state of the gas (in parametric 
form, with parameter 1), i.e. the relation between P, V and T. In the limiting 
case of a Boltzmann gas (e”’7 < 1) these formulae give Clapeyron’s equa- 


? If the energy is calculated from (56.9) as 
E= Nw+TS—PV = —wdQ/op—T 8Q/0T+Q, 


we again obtain (56.8). 
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tion, as they should. We shall show this by means of a calculation which also 
gives the first correction term in the expansion in the equation of state. 

For e“/T < 1 we expand the integrand in (56.12) as a series of powers of 
et/T—2 and, retaining only the first two terms, obtain 


es 
0 


| 25? dz a Z3!2gulT—2( 1 + eviT—2) dz 
3 


ar vier" (1 a a et a). 


Substitution in (56.12) gives 


7 ~ Si 


If only the first term of the expansion is retained, we obtain precisely the 
Boltzmann value of the chemical potential of a monatomic gas (formula 
(46.1a)). The next term gives the required correction, so that we can put 


(2 
eur (56.13) 


But the small additions to all the thermodynamic potentials (expressed in 
terms of the appropriate variables; see (24.16)) are the same. Hence, express- 
ing the correction term in 2 in terms of T and V (which can be done to the 
same accuracy by means of the Boltzmann expressions), we obtain the 
correction to the free‘energy: 

3/2 N23 


‘TTR . (56.14) 


Fe Feo t >> 


Finally, differentiating with respect to volume, we obtain the required equa- 
tion of state: 
a = Nf ene), 


2s Wei (56.15) 


PV = nr 125 


The condition for the correction term in this formula to be small is naturally 
the same as the condition (45.6) for Boltzmann statistics to be applicable. 
Thus we see that the deviations of an ideal gas from classical properties, 
occurring when the temperature is lowered at constant density (the gas then 
being said to become degenerate), cause in Fermi statistics an increase in 
pressure as compared with its value in an ordinary gas; we may say that in 
this case the quantum exchange effects lead to the occurrence of an additional 
effective repulsion between the particles. 
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In. Bose statistics, on the other hand, the value of the gas pressure:changes 
in the opposite direction, becoming less than the classical value; we:may say 
that here there is an effective attraction between the particles. 


§ 57. A degenerate electron gas 


The study of the properties of a Fermi gas at sufficiently low temperatures 
is of fundamental significance. As we shall see below, the temperatures con- 
cerned may in practice be very high in other respects. 

In what follows we shall discuss an electron gas, with a view to the most 
important applications of Fermi statistics, and accordingly put g = 2 (the 
spin s = +). 

Let us first consider an electron gas at a temperature of absolute zero (a 
completely degenerate Fermi gas). In such a gas, the electrons will be distrib- 
uted among the the various quantum states so that the total energy of the gas 
has its least possible value. Since no more than one electron can be in each 
quantum state, the electrons occupy all states with energies from the least 
value (zero) to some greatest value which depends on the number of electrons 
in the gas. 

With the twofold (g = 2) spin degeneracy of the levels, the number of 
quantum states of an electron moving in a volume V with absolute magnitude 
of momentum in the interval from p to p +dp is 


| Anp?>dp-V ___, p* dp 
(QnhP = = ah * a) 


The electrons occupy all states with momenta from zero to a limit p = pr, 
called the radius of the Fermi sphere in momentum space. The total number 
of electrons in these states is 


whence the limiting momentum is given by 
pr = (32")'3 (N/V)4 A, (57.2) 
and the limiting energy by 
ep = ph/2m . 
= (32)? (h2/2m) (N/V PF. | (57.3) 


This energy has a simple thermodynamic significance. In accordance with 
the foregoing discussion, the Fermi distribution function over quantum 
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states (with fixed values of the momentum p and the spin component) is 


= \ 


My = ron ais 7 (57.4) 


and in the limit T -- 0 it becomes a step function, equal to unity for e < u 
and zero for « > yu (the continuous line in Fig. 6). Hence we see that the 


chemical potential of the gas for T = 0 is the same as the limiting energy of 
the electrons: 
pb = ep. (57.5) 


The total energy of the gas is obtained by multiplying the number of states 
(57.1) by p?/2m and integrating over all momenta: 
up 
et wg OU. 
2mrP7h3 10m27h 


0 


E= 


or, substituting (57.2), 


E= (57.6) 


10 m\V 


2\2/3 2 2/3 
3(322) # (N) Be 


Finally, from the general relation (56.8), we find the equation of state of the 
gas: 


P= 


Bryer he (7 )". 


5 m\V oi) 


Thus the pressure of a Fermi gas at absolute zero is proportional to the 4 
power of its density. 

Formulae (57.6), (57.7) are approximately valid also at temperatures which 
are sufficiently close to absolute zero (for a given gas density). The condition 
for them to be applicable (for the gas to be “strongly degenerate”) is clearly 
that T should be small in comparison with the limiting energy ¢;: 


T <« (f2/m)(N/VY*S. (57.8) 


168 The Fermi and Bose Distributions 


‘This condition is, as we should expect, the opposite of the condition (45.6) for 
Boltzmann statistics to be valid. The temperature defined by the relation 
Tr & pis called the degeneracy temperature. 

A degenerate electron gas has the peculiar property that it increasingly 
approaches the ideal gas state as its density increases. This is easily seen as 
follows. 

Let us consider a plasma, i.e. a gas consisting of electrons and a correspond- 
ing number of positively charged nuclei which balance the charge on the 
electrons; a gas composed of electrons alone would obviously be entirely 
unstable, but we have not mentioned the nuclei hitherto, because the assump- 
tion of ideal-gas properties means that the presence of the nuclei does not 
affect the thermodynamic quantities for the electron gas. The energy (per 
electron) of the Coulomb interaction between the electrons and the nuclei is 
of the order of Ze*/a, where Ze is the nuclear charge and a ~ (ZV/N)** is 
the mean distance between the electrons and the nuclei. The condition for an 
ideal gas is that this energy should be small compared with the mean kinetic 
energy of the electrons, which in order of magnitude is equal to the limiting 
energy e,. The inequality Ze*/a « e,, after the substitution of a ~ (ZV/N)'* 
and the expression (57.3) for ep, gives the condition 


NIV >> (e2m/#2) Z?. (57.9) 


We see that this condition is more nearly met as the density N/V of the gas 
increases.‘ 


PROBLEM 


Determine the number of collisions with a wall in an electron gas at absolute 
zero. 

SOLUTION. The number of electrons per unit volume with momenta in the inter- 
val dp at an angle to the normal to the wall in the interval dO is 2-27 sin 6 dO p? 
dp/(2xh)*. The required number of collisions » (per unit area of wall) is obtained 
by multiplying by v cos 9 (v = p/m) and integrating with respect to 0 from 0 to 
‘3% and with respect to p from 0 to pz. The result is 


sis 3(3a02)¥8 h N 4/3 
~ 16 =ey ; 


§ 58. The specific heat of a degenerate electron gas 


At temperatures which are low compared with the degeneracy temperature 
T;, the distribution function (57.4) has the form shown by the broken line in 
Fig. 6: it is appreciably different from unity or zero only in a narrow range of 


* The degeneracy temperature corresponding to the electron gas density 
(e2m/# 8 Z? is 40Z4/3 eV ~ 0.5 108 243 degrees, 
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values of the energy e close to the limiting energy e,. The width of this “tran- 
sition zone” of the Fermi distribution is of the order of T. 

The expressions (57.6), (57.7) are the first terms in the expansions of the 
corresponding quantities in powers of the small ratio 7/T,. Let us now 
determine the next terms in the expansions. 

Formula (56.6) involves an integral of the form 


I= f (2) de 
~ | e@-a/T 1? 
0 
where f(e) is a function such that the integral converges; in (56.6), f(e) = &°” 
We transform this sa by the substitution e—y = Tz: 


ef +i 

H/T 

=r{ fu=% FY pap | LEED oy 
741 er+] 


0 


In the first integral we put 1/(e~7 +1) = 1—1/(e* +1), obtaining 


is [rose r| tes fh Te) 7) dz+T o dz. 


In the second of these integrals we replace the upper limit by infinity, since 
u/T > 1 and the integral is rapidly ome This gives 


ae P fu+T2)—f(u—T2) 


We now expand the numerator of the second integrand as a Taylor series of 
powers of z and integrate term by term: 


= } fe desrmryp wy ( 2% 


+479") bee ae 


t This amounts to neglecting exponentially small terms. It must be remembered 
that the expansion (58.1) derived below is an asymptotic, not a convergent, series. 
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Substituting the valuest of the integrals, we have finally - 


ik 


r= { fe der TY Wt TY + SBA) 
5 


+ Integrals of this type are calculated as follows: 


oo tod 


Eas = [ z*—le—z y (—)te—"* dz = I(x) x (—yt 
e=+-] nad hel n* 


d 
= (1—21-*) I(x) oy = 


or 


oo 


[a = = = (1-2!) xox) (& > 0), 


where (x) = )° 1/n* is the Riemann zeta function. 
nal 


For x = 1, the expression becomes indeterminate; the value of the integral is 


oo 


dz 
e+] = log 2. 


Ld 


0 


For x an even integer (= 2n) the zeta function can be expressed in terms of the 
Bernoulli numbers B,,: 


= —._—— _ 1"B 


¢ 22n—1 dz 2u-1_1 
{ ez+1 2n oS 


The following integrals are calculated similarly: 


oo 


= =Twts) @>d. 
e7—J 

rd 

For x an even integer (= 2n), 


( geal dz omen B, 
e*¥—] 4n 


0 


For reference we shall give the values of the first few Bernoulli numbers and of 
some zeta and gamma f naa 


B, = 5 B, = ao B, = bs B, 
£(3) =: 2.612, C(3) = 1.341, £(3) = 1.202, 
C(5) = 1.037, I($)= baw, (8) = 2x. 
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The third term in the expansion is given for reference; it will not needed 
here. 

Putting f = «*” in formula (58.1) and substituting in (56.6), we obtain the 
required next term in the expansion of the potential 2 at low temperatures: 


3/2 
Q = Q)— yr Vee (58.2) 


where {29 denotes the value of 2 at absolute zero. 

Regarding the second term as a small correction to 2o and expressing pu in 
it in terms of T and V by means of the “zero-order approximation” (57.5), 
we can immediately write down an expression for the free energy (according 
to the theorem of small increments (24.16)): 


F = Fy—48NTUV/N)**, (58.3) 
using for brevity the notation 
\28 m 
B= (3) a (58.4) 
Hence we find the entropy 
S = BNTVINY*, (58.5) 
specific heat* 


C = BNT(VVINY*, (58.6) 
and energy of the gas: | 
E = Ey++4BNTUV/NY* 
= Eo{l +0.18(mT/h?)? (V/N)*}. (58.7) 


Thus the specific heat of a degenerate Fermi gas at low temperatures is 
proportional to the temperature. 


§ 59. Magnetism of an electron gas. Weak fields 


The magnetisation of an electron gas in weak magnetic fields is made up of 
two independent parts: a paramagnetic part due to the intrinsic (spin) 
magnetic moment of the electrons (the Pauli paramagnetism: W. Pauli, 
1927) and a diamagnetic part due to the quantisation of the orbital motion 
of the electrons in the magnetic field (the Landau diamagnetism: L. D. 


t The suffix v or p to the specific heat is omitted, since C, and C, are the same 
in this approximation. We have seen in § 23 that, if S tends to zero as T” when 
T - 0, the difference C,—C, tends to zero as T?"*1, and so inthis case C,—C, ~ T°. 
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Landau, 1930). Let us calculate the corresponding magnetic susceptibilities, 
assuming the gas to be degenerate (the temperature T « e,). The condition 
for the magnetic field to be weak is (see below) BH «T, where B = 
|e| #/2mc is the Bohr magneton.’ 

For a degenerate gas, the thermodynamic calculations are more conven- 
iently performed with 7,V and u as independent variables (instead of T, V 
and N). Accordingly, in place of (52.1) for the magnetic moment of a Boltz- 
mann gas we shall use the derivative 


Me = —(@2/dH)r, Vin (59.1) 


of the thermodynamic potential 2. 

Let us first find the paramagnetic part of the susceptibility. The additional 
(spin) energy of the electron in the magnetic field is BH, the two signs cor- 
responding to the two values +4 of the spin component along the field. The 
statistical distribution of electrons in the magnetic field consequently differs 
from that in the absence of the field in that the energy e = p?/2m is replaced 
by « = p?/2m+ 8H. But, since e appears in the distribution together with 
the chemical potential in the combination ¢—, this change is equivalent to 
replacing 4 by 48H. The potential 2 of an electron gas in a magnetic 
field may therefore be written 


QAp) = FQo(u+BH) +32Q0(u—BH), (59.2) 


where {29(14) is the potential in the absence of the field (the arguments T and 
V are omitted, for brevity); the two terms in (59.2) correspond to the sets 
of electrons with different spin components, and the factors + take account of 
the halving of the number of quantum states of the electron when the value 
of its spin component is fixed. 

Expanding (59.2) in powers of BH, we have 


Q(u) = Qolu) + $BPH? OQ u)/Ou*, (59.3) 


and the magnetic moment is therefore It = — Hf? 0202/Ou?. The derivative 
02o/Ou = —N, and the paramagnetic susceptibility, which in this section 
is taken relative to unit volume of the gas, is 


pie cc! Weg oe (an) (59.4) 
T,V 


t In the opposite case of high temperatures (T >> £,), the electrons form a Boltz- 
mann gas, and the paramagnetic part of the susceptibility per unit volume is 
‘para = NB?/VT (formula (52.8) with g = 2, J= $). 
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Neglecting the temperature effect, which is small when 7 « y, i.e. assuming 
the gas to be completely degenerate, we have from (57.3) 


_ zy (2mp)>? 
N=V yop? 
and differentiation gives 
2m)? 2pm 
Vieec= Pm = Pee (59.5) 


Let us now calculate the diamagnetic susceptibility. The energy levels of 
the orbital motion of an electron in a magnetic field are 


2 
a2 i +(2n+ BH, (59.6) 
where n = 0, 1, 2, ..., and p, is the momentum in the direction of the field, 
taking a continuous range of values from — oo to oo (see Quantum Mechanics, 
§ 112). The number of states in the interval dp, for any given value of n is 


Vie|H 


> Onhyrc 


dp., (59.7) 


the factor 2 taking account of the two directions of the spin. The expression 
(53.4) for the potential 2 is 


Q = 28H Se flu—(2n+1)BH], (59.8) 
f() = etl | log ! +exp Ca nr) | dp:. (59.9) 


—_OO 


The sum (59.8) can be calculated to the necessary accuracy by means of 
the formula’ 


oo 


y F(nt+4) & { F@ dx +34F’(0). (59.10) 
0 


n=0 
Tt According to the familiar Euler-Maclaurin sum formula, 
$F (a)+ x F(atn) = f Fo dx—i5 F(a) (59.10a) 
2 ‘ = i!" (a). : 
Am a 


Formula (59.10) is obtained from this by putting a = 4 and representing the func- 
tion F(x) in the range 0 <x <« i by F(x) = F(0)+-xF’(0). 
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The condition for this to be applicable is that the relative change of F in one 
step (nm - n+1) is small. In respect of the function (59.9), the condition is 
pH «Tt 

Applying (59.10) to the sum (59.8), we have 


Q = 28H ( fu 2pHx) dx SE re eee 


QBH)? Aflu) 
| f(a) dx CE 


The first term does not contain H, i.e. it is the potential 29(2) of the gas in the 
absence of the field. Thus 


a I O7Q20( 4) 
2 = Qu) ¢ BP (59.11) 
and hence the susceptibility ist 
_ BBQ 1 
Xdia = 37 On ae ~ 3 Xpara- (59.12) 


Altogether, the gas is paramagnetic, with susceptibility y = 2y,,,,/3. Here 
we have calculated its two parts separately in order to clarify their origin. 
It would, of course, be possible to calculate the total susceptibility y directly. 
To do so, we should have to write the electron energy levels in the form 
e = p2/2m+(2n+1)BH+ 6H, obtained by adding the spin magnetic energy 
+H to (59.6). This set of values of ¢ may also be written 


2 
e= 5+ InBH, n=0,1,2,..., (59.13) 


each value with 7 + 0 occurring twice, and that with n = 0 once; in other 
words, the number density of states with n + 0 is given by the same formula 


t Otherwise, the condition is violated in the “dangerous” range of values of n 
where fs —(2n+ 1)8H is close to zero. This range causes (see § 60) the occurrence in 
Q2 of terms that oscillate rapidly as functions of H. These terms disappear if the 
series (59.8) is averaged over a range 4H such that the change of the argument 
(s—2BnH near the point where it is zero is much greater than the difference of its 
two neighbouring values: 


pH «nBAH~ wpAH/H, or AH/H > BHlu. 
Formula (59.10) then becomes again valid, and the result given by it is subject 


only to the condition BH « p. 
t This relation is valid for any degree of degeneracy of the gas. 
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(59.7), and that with n = 0 by half the amount. The potential 2 is then given 
by the sum 


2 = anlar ¥ suru}, (59.14) 
and is calculated by means of the formulat 


1F(0)+ 3 F(n) = [ F(a) dx-3F'O. (59.15) 


§ 60. Magnetism of an electron gas. Strong fields 


Let us now consider fields for which 8H is again small in comparison with p, 
but need not be small in comparison with T: 


T SPH «uw. (60.1) 


Under these conditions, the effects of the quantisation of orbital motion 
cannot be separated from the spin effects, and both must be considered 
together; thus we must start from (59.14) in calculating Q. 

We shall see that the magnetisation of the electron gas for BH = T con- 
tains a part which oscillates with a large amplitude as a function of H, and it is 
this oscillatory part of the magnetisation that will concern us here. 

To separate the oscillatory parts of the thermodynamic quantities, it is 
convenient to transform the sum (59.14) by means of Poisson’s formula:t 


5F(0)+ F(n) = i F(x) dx+2re y f F(x)e2*tkx dx, (60.2) 
tm 0 =l 0 


after which it becomes 


TmV bad 
Q = Qu) + —aps Te py Ik (60.3) 


t Obtained from the Euler-Maclaurin formula by putting « = 0, 
t This formula follows from the equation 


y O(x—n) = y e2nikx. 


aim —oco k= —oo 


the sum of the delta functions on the left is a periodic function of x with unit 
period, and the sum on the right is the expansion of this function as a Fourier 
series. On multiplying the equation by any function F(x) and integrating over x 


from 0 to oo, we get equation (60.2); the integral J F(x) 6(x) dx, which is the term 
with n= 0, extended only over the region on one side of x = 0, gives IFO). 
13 
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where 


I, = —2pBH { | log i +exp ($- e-em dxdp., (60.4) 


—oo 


and {29() is the thermodynamic potential in the absence of the field. 
In the integrals I, we replace the variable x by « = p?/2m +2xBH. For the 
required oscillatory part of the integrals, denoted by I,, we have 


=- | | log | texp (=) exp (Grr) exp (—sagi) de dp... 


In the integral over p,, the important values are p?/2m ~ BH. The oscillatory 
part of the integral, however, comes from the range of values of ¢ near us 
(see below); the lower limit of integration over e is therefore taken as zero 
instead of p?/2m. 

The integration over p, is separable and is effected by means of the for- 
mulat 


bi 
—lapt aay, —in/4 
( e dp =e V3 : 
leaving 


I, = —e7tals eH { log [1 + e%—¥/T]elnteibt de. 
5 


k 


In this integral, we integrate twice by parts, and in the remaining integral 
make the change of variable (e—u)/T = §. Omitting the non-oscillatory part, 
we have 


_ VQm)(BHY" —— finku in) [fe inkT 
i, = oe =) ex (ar-az) { (an exp (Fr ®) dé. 


OO 


The lower limit — 4/7 of the integral over ¢ is replaced by — ©, since u >> T 
When BH 2 T, the range  ~ 1 predominates in the integral, i.e. the range 


t This is derived by rotating the path of integration in the complex p-plane: 
we put p = e7!*/ty and integrate over real u from — co to o, 
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of ¢ values near  (e—p ~ T). The integral is calculated from the formula‘ 


we 


C é fag _ 
| (ef +1)" eee sinh a * 


We finally have for the oscillatory part of 2 


6 = VAmBHYPTV S cos (mpk/BH—4n) 


ra) 2, EB? sinh (@RTBH) (60.5) 


In calculating the magnetic moment as the derivative of (60.5), only the 
most rapidly varying factors need be differentiated, namely the cosines in 
the numerators. This gives 


«A (2B)m2uTV = sin (xpk/BH—F2) 
N= SRV Qa Vesinh TBH) 9) 


(L. D. Landau, 1939). This function oscillates with high frequency. Its 
“period” in the variable 1/H is constant, 


A(1/H) = 2B/u, (60.7) 


independent of the temperature. Here 4H/H ~ BH/u « 1.4 
When BH ~ T, the amplitude of oscillation of the magnetic moment 


M ~ VeH#"(mB)? k-°. The “monotonic” part I of the magnetisation, 
determined from the susceptibility calculated in§ 59, is M ~ Vu? Hm®"p74-%. 


Hence De/ aM ~ (u/BH)'"; the amplitude of the oscillating part is large 
compared with the monotonic part. However, if BH « T, this amplitude is 
exponentially small, as exp (—2?7/8H), and becomes negligible. 


t By the substitution wu = 1/(e*+1), the integral is reduced to a beta function: 
1 
J Q—uy* u-* du = PA +ie) Pia) TQ) 
0 
and the result given in the text follows, since 


Fa—2z) FA+2) = 22/sinzz. 


t The oscillations of the magnetisation were qualitatively predicted by Landau 
(1930). In metals, this phenomenon is called the de Haas—-van Alphen effect. 
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§ 61. A relativistic degenerate electron gas 


As the gas is compressed, the mean energy of the electrons increases (e, 
increases); when it becomes comparable with mc?, relativistic effects begin to 
be important. Here we shall discuss in detail a completely degenerate extreme 
relativistic electron gas, the energy of whose particles is large compared with 
mc*. In this case the relation between the energy and momentum of a particle 
is 

e= cp. (61.1) 

The previous formulae (57.1) and (57.2) give the number of quantum states 


and hence the limiting momentum. The limiting energy (i.e. the chemical 
potential of the gas) is now . 


N 1/3 
er = cpr = (3n)¥8 fe(F | (61.2) 
The total energy of the gas is 
y ‘ 
c 3 _ CDp 
= Says [ dp = Fags 
or 
— 2 (428 NS 
E= 7 Gx) nen (7 (61.3) 


The gas pressure can be obtained by differentiating the energy with respect 
to the volume at constant entropy (equal to zero). This gives . 
_ E = 1 2)1/8 N\“ _ 
P= 35 = Gn") Ae(F | , (61.4) 
The pressure of an extreme relativistic electron gas is proportional to the 4/3 
power of the density. 
It should be mentioned that the relation 


PV = $E a (61.5) 


is actually valid for an extreme relativistic gas not only at absolute zero but 
at all temperatures. This is easily seen by exactly the same method as that 
used to derive the relation (56.8), with the energy given by e = cp instead of 
e = p*/2m. With e = cp, formula (53.4) leads to. the following expression 
for Q: 


= — IV ( 2 8) 
Q= wer | ° log (1 +e WP) de, 
0 
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or, integrating by parts, 


1 Vf ede 1 
O-—3 xem | aeooreT ~~ 3 Ot) 


Thus the limiting value that the pressure of any macroscopic body can have 
for a given E (see § 27) is reached for an extreme relativistic Fermi gas. 
Using the variable of integration z = ¢/T, we have 


Q=— saa | Se aT: 


VT! ( dz 
aOR | 
0 


This shows that 
Q = VT'f(u/T). (61.7) 


Hence, as in § 56, we find that in an adiabatic process the volume, pressure 
and temperature of an extreme relativistic Fermi gas are related by 


PY*3 = constant, WT?= constant, T4/P = constant. (61.8) 


These are the same as the usual equation of the Poisson adiabatic with 
y= 4; but it must be emphasised that y here is not the ratio of the specific 
heats of the gas. 


PROBLEMS 


PROBLEM 1. Determine the number of collisions with a wall in an extreme rela- 
tivistic completely degenerate electron gas. 


SOLUTION. The calculation is as in § 57, Problem; it must be remembered that 
the electron velocity v = c. The result is » = {cN/V. 


PROBLEM 2. Determine the specific heat of a degenerate extreme relativistic 
electron gas. 


SoLuTION. Applying the formula (58.1) to the integral in (61.6), we find 
(uT? 
Q = Q, ——-— V. 
°— 6a 
Hence'the entropy 
2 
s= 4 _yr=N 


(3222)28 : ( x) 


and the specific heat 
2/3 1/3 
3ch N 
PROBLEM 3. Determine the equation of state of a relativistic completely de- 
generate electron gas (the electron energy and momentum being related by e? = 
c8p?+ met). 
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SOLUTION. The previous formulae (57.1) and (57.2) give the number of states 
and the limiting momentum, and the total energy is 


Pr : 
[ p? / (mic? +p*) dp, 


¢ 
tv) 


Ve 


i= Op 


whence 
: ¥ 
E= Fr {pp 2ph-+- mc?) \/(pp-+ mic?) —(me}t sinh (pg/mo)}. 


The pressure P = —(OE/OV) 5.2 is 
c 


P= es Pr (5 Ph—nbe) / (pie + mc?) +(me)* sinh Crime) 


These formulae are conveniently put in parametric form, using as parameter the 
quantity § = 4 sinh! (p,/mc). Then 


N/V = (me/fi>-(1/322) sinh 46, 
P = (mic /32n%#) (} sinh E—$ sinh 3€+6), 
E/V = (mtc®/322#8) (sinh €—6). _ 


The chemical potential 4 of the gas (including the rest energy of the particle) is 
equal to the limiting energy ey = e(pz). It is related to the density by 


- = 1 (5 a met) = . 


§ 62. A degenerate Bose gas 


At low temperatures the properties of a Bose gas bear no resemblance to 
those of a Fermi gas. This is evident from the fact that for a Bose gas the 
state of lowest energy, occupied by the gas at T = 0, must be that with 
E = 0 (all the particles being in the quantum state with « = 0), whereas a 
Fermi gas has a non-zero energy at absolute zero. 7 

If the temperature of the gas is lowered at constant density N/V, the 
chemical potential u given by equation (56.5) (with the lower sign) will 
increase, i.e. its absolute magnitude will decrease (since py is negative). It 
reaches the value u = 0 at a temperature determined by the equation 


g(mT)9 (4/2 dz 
“aims | ee” 
0 


N 
77 (62.1) 
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The integral in (62.1) can be expressed in terms of the zeta function; see the 
second footnote to § 58. Denoting the required temperature by To, we obtain 
_ 3.31 # (7) 


To = a5 in (7 


(62.2) 
For T < To, equation (56.5) has no negative solutions, whereas in Bose sta- 
tistics the chemical potential must be negative at all temperatures. 

This apparent contradiction arises because under the conditions in ques- 
tion it is not legitimate to go from the summation in formula (54.3) to the 
integration in (56.5): in this process the first term in the sum (with e, = 0) is 
multiplied by ~/e = 0 and so disappears from the sum; but, as the tempera- 
ture decreases, more and more particles must occupy that state of lowest 
energy, until at 7 = O they are all in it. The mathematical effect of this is 
that, when the limit » -> 0 is taken in the sum (54.3), the sum of all the terms 
in the series except the first tends to a finite limit given by the integral (56.5), 
but the first term (with ¢, = 0) tends to infinity. Consequently, by letting u 
tend not to zero but to some small finite value, we can make this first term 
in the sum take the desired finite value. 

In reality, therefore, the situation for T < Tp is as follows. Particles with 
energy « > O are distributed according to formula (56.4) with u = 0: 


gemPV 4/ede 
dN, = “FIA RS Sir . (62.3) 


The total number of particles with energies « > 0 will thus be 


2 _ BV (mT? f a/z dz 
ai | on = et 


a 
0 


= N(T/To)*. 


The remaining 
Nemo = N[1—(T/To)*”] (62.4) 


particles are in the lowest state, i.e. have energy ¢ = 0.' The energy of the gas 
for T < Tois, of course, determined only by the particles with « > 0; putting 
tt = 0 in (56.7), we have 
pa SVMTYPT p28 de 
= “rapes | “ant 
9 


t The steady increase of particles in the state with ¢ = 0 is called Bose-Einstein 
condensation. It should be emphasised that this refers only to “condensation” in 
momentum space; no condensation actually occurs in the gas, of course. 
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This am reduces to ¢(5) (see the second footnote to § 58), and we obtain 
| E = 0.710NT (T/To)” sll 


= 0.128¢(m32752/43)V, (62.5) 
The specific heat is therefore 
. Cy = 5E/2T, ~ (62.6) 
i.e. is proportional to T°", Integration of the specific heat gives the entropy: 
ee = 5E/37, (62.7) 
and the free energy is | 
F = E-TS =—2E. (62.8) 


This is obvious, since for u = 0 
| F=@-—PV = Nut+Q = 2. 
The pressure is 


= —(OF/OV)r = 0.0851gm3?T 52/73, (62.9) 


We see that for T’ < To the pressure is proportional to T°” and is independent 
of the volume. This is the natural consequence of the fact that particles in a 
state with e = 0 have no momentum and make no contribution to the 
pressure. 

At the point 7 = 7» itself, all the above-mentioned thermodynamic quan- 
tities are continuous, but it may be shown that the derivative of the specific 
heat with respect to temperature is discontinuous there (see the Problem). 
The curve of the specific heat itself as a function of temperature has a change 
in slope at T = To, and the specific heat has its maximum value there (equal 
to 1.28 3N/2)." 


PROBLEM 
Determine the discontinuity of the derivative (OC,/07), at T = Tp). 


SOLUTION. To solve this problem we must determine the energy of the gas for 
small positive T—7,. The equation (56.5) is identical with 


gVm r 1 i 
N=N(T)+ ae | Foe a 1 s/s de. 


t Note, however, that this behaviour of the specific heat is the result of entirely 
neglecting the interaction of the gas particles; even a weak interaction changes the 
situation. 
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where N,(T) is given by (62.1). Expanding the integrand and using the fact that u 
is small near the point T = Tp, and therefore the important part of the integral 
arises from the region where ¢€ is small, we find that the integral is equal to 


" de 
ro | ene (1) 
0 


Substituting this value and then expressing ps in terms of N—No, we have 


ait (No—N\? 
b= Sa ( a 


_To the same accuracy we can write 
OE 3 02 3 3 


whence 
3 3707hf No—N\? 
E= Eyt > Not = Bans Noy) ’ 


where E, = E,(T) denotes the energy for . = 0, i.e, the function (62.5). The sec- 
ond derivative of the second term with respect to temperature will clearly give 
the required discontinuity. The result of the calculation is 


aC,\ _ 6s 1 ON\? ae | 
(sz), =~ aayt Nol ar) fe = —3.66.N/Tp. (2) 


The value of the derivative (OC,/07), for T= T)—0 is, from (62.5), +2.89N/To, 
and for T = T)+0 it is therefore —0.77N/Tp. 


§ 63. Black-body radiation 


The most important application of Bose statistics relates to electromagnetic 
radiation which is in thermal equilibrium—called black-body radiation. Such 
radiation may be regarded as a gas consisting of photons. The linearity 
of the equations of electrodynamics expresses the fact that photons do not 
interact with one another (the principle of superposition for the electro- 
magnetic field), so that the photon gas is an ideal gas. Because the angular 
momentum of the photons is integral, this gas obeys Bose statistics. 

If the radiation is not in a vacuum but in a material medium, the condition 
for an ideal photon gas requires also that the interaction between radiation 
and matter should be small. This condition is satisfied in gases throughout 
the radiation spectrum except for frequencies in the neighbourhood of ab- 
sorption lines of the material, but at high densities of matter it may be vio- 
lated except at very high temperatures. 
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It should be remembered that at least a small amount of matter must be 
present if thermal equilibrium is to be reached in the radiation, since the 
interaction between the photons themselves may be regarded as completely 
absent.t The mechanism by which equilibrium can be established consists in 
the absorption and emission of photons by matter. This results in an im- 
portant specific property of the photon gas: the number of photons N in it 
is variable, and not a given constant as in an ordinary gas. Thus N itself must be 
determined from the conditions of thermal equilibrium. From the condition 
that the free energy of the gas should be a minimum (for given T and V), we 
obtain as one of the necessary conditions 0F/ON = 0. Since (6F/ON)r » = pL; 
this gives 

p= 0, (63.1) 
i.e. the chemical potential of the photon gas is zero. 

The distribution of photons among the various quantum states with 
definite values of the momentum /k and energies « = Aw = fick (and defi- 
nite polarisations) is therefore given by formula (54.2) with u = 0: 


A, = 1f(etT—1). (63.2) 


This is called Planck’s distribution. 

Assuming that the volume is sufficiently large, we can make the usual 
change (see Fields,§ 52) from thediscrete tothe continuous distribution of eigen- 
frequencies of the radiation. The number of modes of oscillation for which 
the components of the wave vector k lie in the intervals d°k = dk, dk, dk, is 
V d’k/(2x)3, and the number of modes for which the absolute magnitude 
of the wave vector lies in the range dk is correspondingly V -4ak* dk/(2x)*. 
Using the frequency w = ck and multiplying by 2 (for the two independent 
directions of polarisation of the oscillations), we obtain the number of 
quantum states of photons with frequencies between w and w +dw: 


Vo? da/n?c°. (63.3) 


Multiplying the distribution (63.2) by this quantity, we find the number of 
photons in this frequency interval: 


Vs w* dw 


No = TS Gol]? 


(63.4) 


and a further multiplication by Aw gives the radiation energy in this segment 
of the spectrum: 

Vi widw 
We gor —T 


dE, = (63.5) 


+ Apart from the entirely negligible interaction (the scattering of light by light) 
which is due to the possible production of virtual electron—positron pairs. 
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This formula for the spectral energy distribution of black-body radiation is 


called Planck's formula.* In terms of the wavelength 4 = 2xc/«, it becomes 


l6n2%chV dA 
dé; = an “gankclTA __ | ° (63.6) 


At low frequencies (fA « T), formula (63.5) gives 
dz, = V(T/r?c*)w? do. (63.7) 


This is the Rayleigh-Jeans formula. It should be noticed that formula (63.7) 
does not contain the quantum constant 4, and can be derived by multiplying 
by T the number of modes (63.3); in this sense it corresponds to classical 
statistics, in which an energy T must correspond to each “vibrational degree 
of freedom”—the law of equipartition (§ 44). 

In the opposite limiting case of high frequencies (Aw >> T), formula (63.5) 


becomes 
dE, = V(ii/n*c?)we—%e'T dao. (63.8) 


This is Wien’s formula. 
Figure 7 shows a graph of the function x*/(e*—1), corresponding to the 
distribution (63.5). 


33 
e*-| 


Fic. 7 


The density of the spectral frequency distribution of the energy of black- 
body radiation, d£,,/dw, has a maximum at a frequency ,, given by 


Am, |T = 2.822. (63.9) 


+ The discovery of .this law by M. Planck (1900) was the foundation of the 
quantum theory. 
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Thus, when the temperature rises, the position of the maximum of the distri- 
bution is displaced towards higher frequencies in proportion to T (the 
displacement law).* 

Let us calculate the thermodynamic quantities for black-body radiation. 
For 4 = 0, the free energy is the same as 22 (since F = ®—-PV = Nu+Q). 
According to formula (54.4), in which we put 4 = 0 and change in the usual 
way (by means of (63.3)) from summation to integration, we obtain 


a V 2 _e- 
Pat ae | ° log (1—e-*'T) do. (63.10) 
With the new variable of integration x = fw/T, integration by parts gives 
T* ¢ x8dx 
BE ages } e—1° 


The integral is equal to 24/15 (see the second footnote to § 58). Thus 


F =—V-2T4/45(fic)? 
= —40VT#/3c. (63.11) 


If T is measured in degrees, the coefficient o (called the Stefan-Boltzmann 
constant) is 
ao = 27k*/60hFc? 
= 5.67X 1075 g/sec® deg*. (63.12) 


The entropy is 
S =—0F/0T = l60VT*/3c, (63.13) 


and is proportional to the cube of the temperature. The total radiation energy 
E= F4+TS is 
E = 40VT4/c = —3F. (63.14) 


This expression could, of course, be derived also by direct integration of the 
distribution (63.5). Thus the total energy of black-body radiation is propor- 
tional to the fourth power of the temperature. This is Boltzmann's law. 

For the specific heat of the radiation C, we have 


Cy = (BE/OT)y = 160TV |c. (63.15) 


t The wavelength distribution density d£,/d/ also has a maximum, but at a 
different value of the corresponding ratio: 27fic/T/,, = 4.965. Thus the maximum 
Am Of the wavelength distribution is displaced in inverse proportion to the temper- 
ature. 
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Finally, the pressure is 
P =—(0F/eV)r = 40T */3c, (63.16) 
PV = 4E. (63.17) 


Thus for a photon gas the same limiting value of the pressure is obtained as 
for an extreme relativistic electron gas (§ 61); this is as it should be, since the 
relation (63.17) is a direct consequence of the linear relation (e = cp) be- 
tween the energy and momentum of a particle. 

The total number of photons in black-body radiation is 


N= 


mcs | “eholT __] ~ maciys | ex—1° 
0 


Vv i cot deo VTS f x2dx 


The integral can be expressed in terms of ¢(3); see the second footnote to 


§ 58. Thus 
2¢(3) (T \$ T \ 
N==5 (ie) V = 0.244( 5. V. (63.18) 

In an adiabatic expansion (or compression) of the photon gas, the volume 
and temperature are related by VT? = constant. From (63.16), the pressure 
and volume are then related by PV “® = constant. A comparison with (61.8) 
shows that the equation of the adiabatic for a photon gas coincides (as we 
should expect) with that for an extreme relativistic gas. 

Let us consider a body in thermal equilibrium with black-body radiation 
around it. The body continually reflects and absorbs photons incident on it, 
and at the same time emits new ones, and in equilibrium all these processes 
balance in such a way that the distribution of photons in frequency and direc- 
tion remains unchanged on the average. 

Owing to the complete isotropy of the black-body radiation, each volume 
element emits a flux of energy uniformly in all directions. We use the notation 


€o(«) = eis dE. = ee. 

4naV do 4bc3(eho/T — 1) 

for the spectral density of black-body radiation per unit volume and unit 

solid angle. Then the energy flux density with frequencies in the interval do 

leaving each point and entering the solid angle element do is ceo(w) do dw. 

The radiation energy (with frequencies in dw) incident in unit time on unit 

area of the surface of the body at an angle 9 to the normal is therefore 
Céo(w) cos 6 do dw, do = 2x sin 6 dé. 

Let A(, 0) denote the absorbing power of the body as a function of the 
frequency and direction of incidence of the radiation; this quantity is defined 
as the fraction of the radiation energy incident on the surface of the body, in 
the given frequency interval, which is absorbed by the body, not including 
the radiation (if any) which passes through the body. Then the quantity of 


(63.19) 
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radiation absorbed per unit time and surface area will be 
ceo(w) A(w, 8) cos 6 do dw. (63.20) 
Let us assume that the body does not scatter radiation and is not fluores- 
cent, i.e. that the reflection occurs without change in the angle @ or in the 
frequency. We shall also suppose that the radiation does not pass through 
the body; in other words, all radiation not reflected is completely absorbed. 
Then the quantity of radiation (63.20) must be balanced by the radiation 
emitted by the body itself in the same directions at the same frequencies. 
Denoting by J(a, 6) da do the intensity of emission from unit area of the sur- 
face and equating it to the absorbed energy, we obtain 
J(@, 9) = ceo(w) A(a, 9) cos 8. (63.21) 
The functions J(@, 6) and A(a, 6) are, of course, different for different bodies, 
but we see that their ratio is independent of the properties of the body and is 
a universal function of frequency and direction: 
J(a, 9)/A(w, 9) = ceo(w) cos 6, 
which is determined by the energy distribution in the black-body radiation 
spectrum (at a temperature equal to that of the body). This is Kirchhoff’s law. 
If the body scatters radiation, Kirchhoff’s law can be formulated only in 
a more restricted way. Since in this case reflection occurs witha change in 
the angle 6, we can derive from the condition of equilibrium only the require- 
ment that the radiation (of a given frequency) absorbed from all directions 
should be equal to the total emission from the body in all directions: 
f J(@, 9) do = ceo(w) [ A(@, 9) cos 6 do. (63.22) 
The angle 0 also changes, in general, when radiation can pass through the 
body (because of refraction on entering and leaving the body). In this case 
the relation (63.22) must be integrated over the entire surface of the body; 
the functions A(a, 6) and J(m, 6) now depend not only on the material of the 
body but also on its shape and on the point considered on its surface. 
Finally, when there is scattering with change of frequency (fluorescence), 
Kirchhoff’s law applies only to the integrals over both direction and fre- 
quency of the radiation: 


ff 7(@, 8) do dw = ¢ ff ew) A(@, 6) cos 8 dodo. (63.23) 


A body which completely absorbs all radiation incident on it is called a 
black body.t For such a body, A(w, 6) = 1 by definition, and its emissive 


t Such a body can be realised in the form of a cavity with highly absorbing 
internal walls and a small aperture. Any ray entering through the aperture can 
return to it and leave the cavity only after repeated reflection from the walls 
of the cavity. When the aperture is sufficiently small, therefore, the cavity will 
absorb practically all the radiation incident on the aperture, and so the surface 
of the aperture will be a black body. 
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power is entirely determined by the function 
J(w, 8) = ceo(w) cos 9, (63.24) 


which is the same for all black bodies. It may be noted that the intensity of 
emission from a black body is a very simple function of direction, being pro- 
portional to the cosine of the angle to the normal to the surface of the body. 
The total intensity of emission from a black body, Jo, is obtained by integrat- 
ing (63.24) over all frequencies and over a hemisphere: 


eo nf2 
Jo = € { eo(w) dw f 2m cos 6 sin 6 dé = cE/AV, 
0 0 


where E is given by (63.14). Thus 
Jo = oT*, (63.25) 


i.e. the total intensity of emission from a black body is proportional to the 
fourth power of its temperature. 

Finally, let us consider radiation not in thermal equilibrium, having a non- 
equilibrium spectral or directional distribution. Let e(w,n)dwdo be the 
volume density of this radiation in the frequency interval dw and with the 
direction n of the wave vector lying in the solid-angle element do. We can 
use the concept of the temperature of the radiation in each small interval of 
frequency and direction, defined as the temperature for which the density 
e(w,n) is equal to that given by Planck’s formula, i.e. e(@, n) = eo(a). 
Denoting this temperature by T,, ,, we have 


hw 


hie 1 ; 
log { i 4c? e(o, a 


Ton = (63.26) 


Let us imagine a black body emitting into a surrounding vacuum. The 
radiation is propagated freely along straight lines and will not be in thermal 
equilibrium outside the body; it is by no means isotropic, as equilibrium 
radiation must be. Since the photons are propagated in a vacuum and do not 
interact with one another, we are in a position to apply Liouville’s theorem 
rigorously to the photon distribution function in the corresponding phase 
space of coordinates and wave-vector components.’ According to this theo- 
rem, the distribution function remains constant along the phase trajectories. 
But the distribution function is, apart from a factor dependent on frequency, 
the same as the volume density of radiation of a given frequency and 


+t When considering the limiting case of geometrical optics, we can speak of 
coordinates of a photon. 
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direction, e(, n, r). Since the radiation frequency is also constant during pro- 
pagation, we have the following important result: in every solid-angle element 
where radiation is propagated (from a given point in space) the radiation 
density e(w, n, r) is equal to the density within the emitting black body, i.e. 
to the black-body radiation density e9(w). Whereas, however, for equilibrium 
radiation the density exists for all directions, here it exists only for a certain 
interval of directions. 

Defining the temperature of non-equilibrium radiation by (63.26), we can 
express the result differently by saying that the temperature T, , is equal to 
the temperature T of the emitting black body for all directions in which radia- 
tion is being propagated (at any given point in space). If the radiation temper- 
ature is defined from the density averaged over all directions, however, it is 
of course less than the temperature of the black body. 

All these consequences of Liouville’s theorem remain fully valid when 
reflecting mirrors and refracting lenses are present, provided, of course, that 
the conditions for geometrical optics to be applicable are still satisfied. By 
means of lenses or mirrors the radiation can be focused, i.e. the range of 
directions from which rays reach a given point in space can be enlarged. This 
may increase the mean radiation temperature at the point considered, but the 
foregoing discussion shows that there is no means of raising it above the tem- 
perature of the black body which emitted the radiation. 


CHAPTER VI 


SOLIDS 


§ 64. Solids at low temperatures 


Soups form another suitable topic for the application of statistical methods 
of calculating the thermodynamic quantities. A characteristic property of 
solids is that the atoms in them execute only small vibrations about certain 
equilibrium positions, the crystal lattice sites. The configuration of the lattice 
sites which corresponds to thermal equilibrium of the body is preferred, i.e. 
distinguished from all other possible distributions, and must therefore be 
regular. In other words, a solid in thermal equilibrium must be crystalline. 

According to classical mechanics, all the atoms are at rest at absolute 
zero, and the potential energy of their interaction must be a minimum in 
equilibrium. At sufficiently low temperatures, therefore, the atoms must 
always execute only small vibrations, i.e. all bodies must be solid. In reality, 
however, quantum effects may bring about exceptions to this rule. One such 
is liquid helium, the only substance which remains liquid at absolute zero 
(at pressures that are not too high); all other substances solidify well before 
quantum effects become important.* 

For a body to be solid its temperature must certainly be small in compari- 
son with the energy of interaction of the atoms (in practice, all solids melt 
or decompose at higher temperatures). From this it results that the vibrations 
of atoms in a Solid about their equilibrium positions are always small. 

As well as crystals, there exist in Nature also amorphous solids, in which 
the atoms vibrate about randomly situated points. Such bodies are thermo- 
dynamically metastable, and must ultimately become crystalline. In practice, 
however, the relaxation times are so long that amorphous bodies behave as if 
stable for an almost unlimited time. All the following calculations apply 
equally to both crystalline and amoprhous substances. The only difference is 
that, since amorphous bodies are not in equilibrium, Nernst’s theorem does 
not apply to them, and as T — 0 their entropy tends to a non-zero value. 


t Quantum effects become important when the de Broglie wavelength corre- 
sponding to the thermal motion of the atoms becomes comparable with the distan- 
ces between atoms. In liquid helium this occurs at 2-3°K. 
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Consequently, for amorphous bodies the formula (64.7) derived below for the 
entropy has to be augmented by some constant S (and the free energy by a 
corresponding term —7So); we shall omit this unimportant constant, which, 
in particular, does not affect the specific heats of a body. 

The residual entropy, which does not vanish as T - 0, may also be ob- 
served in crystalline solids, because of what is called ordering of crystals. 
If the number of crystal lattice sites at which atoms of a given kind can be 
situated is equal to the number of such atoms, there will be one atom near 
each site; that is, the probability of finding an atom (of the kind in question) 
in the neighbourhood of each site is equal to unity. Such crystals are said to 
be completely ordered. There are also, however, crystals in which the atoms 
may be not only at their “own” positions (i.e. those which they occupy in 
complete ordering) but also at certain “other” positions. In that case the 
number of sites that may be occupied by an atom of the given kind is 
greater than the number of such atoms, and the probability of finding atoms 
of this kind at either the old or the new sites will not be unity. 

For example, solid carbon monoxide is a molecular crystal, in which the 
CO molecule can have two opposite orientations differing by interchange of 
the two atoms; the number of sites that may be occupied by carbon (or oxy- 
gen) atoms is here equal to twice the number of these atoms. 

In a state of complete thermodynamic equilibrium at absolute zero, any 
crystal must be completely ordered, and the atoms of each kind must occupy 
entirely definite positions.‘ However, because the processes of lattice re- 
arrangement are Slow, especially at low temperatures, a crystal which is incom- 
pletely ordered at a high temperature may in practice remain so even at very 
low temperatures. This “freezing” of the disorder leads to the existence of a 
constant residual term in the entropy of the crystal. For instance, in the ex- 
ample of the CO crystal mentioned above, if the CO molecules have the two 
orientations with equal probability, the residual entropy will be So = log 2. 

Let N be the number of unit cells in the lattice, and v the number of 
atoms in each cell. Then the number of atoms is Nv. Of the total number of 
degrees of freedom 3N», three correspond to translational and three to rota- 


t Strictly speaking, this statement also is valid only if quantum effects are neg- 
lected. The latter may become important (at 7 = 0) if the amplitude of the zero- 
point vibrations of the atoms in the lattice is comparable with the interatomic 
distances. In a “quantum crystal” of this kind, a situation is in principle possible 
where the number of sites in the ground state (the state at T = 0) exceeds the num- 
ber of atoms. The “zero-point” defects (free vacancies) then present in the lattice 
are, however, not localised at any particular sites as they would be in a “classical” 
crystal, and represent a collective property of the lattice without disturbing its 
strict periodicity; see A. F. Andreev and I. M. Lifshits, Soviet Physics JETP 29, 
1107, 1969, 
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tional motion of the body as a whole. The number of vibrational degrees of 
freedom is therefore 3Nv—6, but since 3Nv is extremely large we can, of 
course, neglect 6 and assume that the number of vibrational degrees of free- 
dom is just 3N». 

It should be emphasised that in discussing solids we shall entirely ignore 
the “internal” (electronic) degrees of freedom of the atoms. Hence, if these 
degrees of freedom are important (as they may be, for example, in metals), 
the following formulae will relate only to the lattice part of the thermo- 
dynamic quantities for the solid, which is due to the vibrations of the atoms. 
In order to obtain the total values of these quantities, the electronic part 
must be added to the lattice part. 

In mechanical terms, a system with 3Nv vibrational degrees of freedom 
may be regarded as an assembly of 3Nv independent oscillators, each corre- 
sponding to one normal mode of vibration. The thermodynamic quantities 
relating to one vibrational degree of freedom have already been calcuated 
in § 49. From the formulae there we can immediately write down the free 
energy of the solid as’ 


F = Neg+T ¥ log (1—e-*#u7), (64.1) 


The summation is over all 3Nv normal vibrations, which are labelled by the 
suffix «.t We have added to the sum over vibrations a term Neo which repre- 
sents the energy of all the atoms in the body in their equilibrium positions 
(more precisely, when executing their “zero-point” vibrations); this term 
depends on the density, but not on the temperature: &9 = e0(N/V). 

Let us now consider the limiting case of low temperatures. For small T, 
only the terms with low frequencies (Aw, ~ T) are of importance in the sum 
over a. But vibrations with low frequencies are just ordinary sound waves, 
whose wavelength is related to the frequency by 4 ~ u/w, where u is the 
velocity of sound. In sound waves the wavelength is large in comparison 
with the lattice constant (A >> a), and so w « u/a. In other words, the vibra- 
tions can be regarded as sound waves at temperatures such that 


T <« hu/a. (64.2) 


Let us assume that the body is isotropic (an amorphous solid). In an iso- 
tropic solid, longitudinal sound waves can be propagated (with velocity u,) 
and so can transverse waves with two independent directions of polarisation 
and equal velocities of propagation u,; see Elasticity, § 22. The frequency 


t Quantised vibrations were first used by Einstein (1907) to calculate the ther- 
modynamic quantities for a solid. 


+t See (71.7) for an integral form of this. 
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of these waves is linearly related to the absolute magnitude of the wave 
vector k by m = u,k or m = u,k. 

The number of vibrational modes in the spectrum of sound waves with 
absolute magnitude of the wave vector lying in the interval dk and with a 
given polarisation is V-42k? dk/(2x)°, where V is the volume of the body. 
Putting for one of the three independent polarisations k = w/u, and for the 
other two k = w/u,, we find that the interval dw contains altogether 


wdwo/1 2 
yoo (ata) (64.3) 
vibrations. 
A mean velocity of sound % can be defined according to the formula 
JO ea ieh ee 
we wh oP 


Then the expression (64.3) becomes 
V +300? de /2748 . (64.4) 


In this form it is applicable not only to isotropic bodies but also to crystals, 
where % = u(V/N) must be understood as the velocity of propagation of 
sound in the crystal, averaged in a certain way. The determination of the 
averaging procedure requires the solution of the problem (which belongs 
to the theory of elasticity) of the propagation of sound in a crystal of given 
symmetry.t 

By means of (64.4) we can change from the summation in (64.1) to integra- 
tion, obtaining 


am 3V — e—fiw 2 ° 
PS Neo+T aaa ( log a é Tay da; (64.5) 


¢ 
0 


because of the rapid convergence of the integral when T is small, the integra- 
tion can be taken from 0 to oo. This expression (apart from the term Neo) 
differs from the formula (63.10) for the free energy of black-body radiation 
only in that the velocity of light c is replaced by the velocity of sound % and 
a factor appears. This resemblance is not surprising, since the frequency 
of sound vibrations is related to their wave number by the same type of linear 
formula as is valid for photons. Theintegers v, in the energy levels Sv,iw, of 
a system of sound oscillators may be regarded as occupation numbers of the 


t In an anisotropic medium, there are in general three different branches of the 
sound-wave spectrum, in each of which the velocity of propagation is dependent 
on direction (see Elasticity, § 23). 
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various quantum states with energies ¢, = fw,, the values of these numbers 
being arbitrary (as in Bose statistics). The appearance of the extra factor 
+ in (64.5) is due to the fact that sound vibrations have three possible 
directions of polarisation instead of two as for photons. 

Thus, without having to repeat the calculations, we can use the expression 
(63.11) derived in § 63 for the free energy of black-body radiation, if c is 
replaced by @ and a factor 3 included. The free energy of a solid is therefore 


F = Neg—V -n?T4/30(hu)’; (64.6) 
the entropy is 

S = V-20°T3/15(huy? , (64.7) 
the energy 

E = Negt+V-7PT4/10(hi) , (64.8) 
and the specific heat 

C = 2°TV /[S(huy. (64.9) 


Thus the specific heat of a solid at low temperatures is proportional to the 
cube of the temperature’ (P. Debye, 1912). We write the specific heat as C 
simply (not distinguishing C,, and C,), since at low temperatures the differ- 
ence C,—C, is a quantity of a higher order of smallness than the specific 
heat itself (see § 23; here S oc T* and so C,—C,, oc T”). 

For solids having a simple crystal lattice (elements and simple compounds) 
the 7° law for the specific heat does in fact begin to hold at temperatures of 
the order of tens of degrees, but for bodies with a complex lattice this law 
may be expected to be satisfactorily obeyed only at much lower tempera- 
tures. 


§ 65. Solids at high temperatures 


Let us now turn to the opposite limiting case of high temperatures (of 
order T > hu/a, where a is the lattice constant). In this case we can put 
1—e~*eT ~ fey /T, and formula (64.1) becomes 


F = Neo+T ¥ log (fa,/T). (65.1) 


The sum over « contains altogether 3Nv terms. We define the “geometric 
mean” frequency © by 


= 1 
log & = Sie 2 log a@,. (65.2) 


t It may be recalled that when “electronic degrees of freedom” are present 
these formulae give only the lattice part of the thermodynamic quantities. However, 
even when there is an electronic part (as in metals) it begins to affect the specific 
heat, for example. only at temperatures of a few degrees. 
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Then the free energy of the solid is given by 
F = Neg—3N1T log T+ 3N1T log ho. (65.3) 


The mean frequency &, like i, is a function of the density, @(V/N). 
From (65.3) we find the energy of the body, E = F—T oF/07: 


E = Néo+3N0T. (65.4) 


The case of high temperatures corresponds to the classical treatment of 
the vibrations of the atoms; it is therefore clear why formula (65.4) accords 
exactly with the law of equipartition (§ 44): apart from the constant Néo, an 
energy T corresponds to each of the 3N» vibrational degrees of freedom. 

For the specific heat we have 


C = Ne = 3N), (65.5) 


where c = 3» is the specific heat per cell. We again write the specific heat 
as C simply, since in solids the difference between C, and C, is always negli- 
gible (see the end of § 67). 

Thus at sufficiently high temperatures the specific heat of a solid is constant 
and depends only on the number of atoms in the body. In particular, the 
specific heat per atom must be the same for different elements with a 
simple crystal lattice (vy = 1) and equal to 3; this is Dulong and Petit’s law. 
At ordinary temperatures this law is well satisfied for many elements. 
Formula (65.5) is valid at high temperatures for a number of simple com- 
pounds also, but for more complex compounds it gives a limiting value 
of the specific heat which in general is not reached before the substance 
melts or decomposes. 

Substituting (65.5) in (65.3) and (65.4), we can write the free energy and 
energy of a solid as 


F = Neo—NcT log T+NcT log ha, (65.6) 

E = Neo+NcT. (65.7) 
The entropy S = —0F/0T is 

S = Ne log T—Nc log (h@/e). (65.8) 


Formula (65.1) can also, of course, be derived directly from classical 
statistics, using the general formula (31.5) 


F = —Tlog f e~Fu.om dW. (65.9) 


For a solid, the integration over the coordinates in this integral is carried out 
as follows. Each atom is regarded as being situated near a particular lattice 
site, and the integration over its coordinates is taken only over a small 
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neighbourhood of that site. It is clear that all the points in the region of 
integration thus defined will correspond to physically different microstates, 
and no additional factor is needed in the integral.* 

We substitute in (65.9) the energy expressed in terms of the coordinates 
and momenta of the normal modes: 


E(p, 9) = + Y (pi + w2q2), (65.10) 


and write dJ" in the form 


1 
dl = Gann LI dpa du. . 


Then the integral becomes a product of 3N?v integrals, all of the form 


eo 


ff exp {—(p2 +. 03g2)/2T} dp, dq. = 2aT/o,, 
leading to formula (65.1); because of the rapid convergence of the integral, 
the integration over q, may be extended from — © to o. 

At sufficiently high temperatures (provided that the solid does not melt or 
decompose) the effects of anharmonic vibrations of the atoms may become 
appreciable. The nature of these effects as regards the thermodynamic quanti- 
ties for the body may be investigated as follows; cf. the similar calculations 
for gases in§ 49. Taking into account the terms following the quadratic terms 
in the expansion of the potential energy of the vibrations in powers of q,, 
we have 


E(p, g) = folp. D+SAD+AM + ---> 


where /2(p, q) denotes the harmonic expression (65.10) (a quadratic form in 
q,, and p,), and f(g), f(g), -.. are forms homogeneous in all the coordinates 
g., of degree three, four, etc. Substituting in the partition function in (65.9) 
Iu = %, VT, P, = P, VT, we obtain 


Fs f e-E. MIT ql 
= T3* [" exp (-falp’, 1) —VTf(q)-Th)— -.-} oT. 


We see that, when the integrand is expanded in powers of the temperature, 
all odd powers of »/T are multiplied by odd functions of the coordinates, 
which give zero on integration over the coordinates. Hence Z is a series 
Z = Zo+TZi1+T?Z2+... which contains only integral powers of the 


+ Whereas it was for a gas, where the integration over the coordinates of each 
particle was taken over the whole volume (cf. the end of § 31). 
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temperature. On substitution in (65.9), the first correction term to the free 
energy will accordingly be of the form 


Fron = AT?, (65.11) 


i.e. proportional to the square of the temperature. In the specific heat it gives 
acorrection’ proportional to the temperature itself. It should be emphasised 
that the expansion under discussion here is essentially one in powers of the 
ratio T/é9, which is always small, and not, of course, in powers of the ratio 
T/h@, which in the present case is large. 


PROBLEMS 
PROBLEM 1. Determine the maximum work which can be obtained from two 


identical solid bodies at temperatures 7, and 7, when their temperatures are made 
equal. 


SOLUTION. The solution is similar to that in § 43, Problem 12, and gives 
| R lax = Ne(\/T, —V/To’. 


PROBLEM 2, Determine the maximum work which can be obtained from a solid 
when it is cooled from a temperature T to the temperature 7, of the medium (at 
constant volume). 


SOLUTION. From formula (20.3) we have 
|Rimax = Ne(T—T)+NeT, log (T/T). 


§ 66. Debye’s interpolation formula 


Thus in both the limiting cases of low and high temperatures it is possible 
to make a sufficiently complete calculation of the thermodynamic quantities 
for a solid. In the intermediate temperature range, such a calculation in a 
general form is impossible, since the sum over frequencies in (64.1) depends 
considerably on the actual frequency distribution over the whole spectrum 
of vibrations of the body concerned. 

It is therefore of interest to construct a single interpolation formula giving 
the correct values of the thermodynamic quantities in the two limiting cases. 
More than one such formula can be found, of course, but we should expect 
that a reasonable interpolation formula will give at least a qualitatively cor- 
rect description of the behaviour of the body throughout the intermediate 
range. 

The form of the thermodynamic quantities for a solid at low temperatures 
is given by the distribution (64.4) of the frequencies in the vibration spectrum. 


t This correction is usually negative (corresponding to positive A in (65.11). 


§ 66 Debye’s Interpolation Formula 199 


At high temperatures it is important that all the 3N» vibrations are excited. 
To construct the required interpolation formula, therefore, it is reasonable to 
start from a model in which the law (64.4) (which in reality is valid only at 
low frequencies) governs the frequency distribution over the whole vibration 
spectrum, the spectrum beginning at = 0 and terminating at some finite 
frequency w,, determined by the condition that the total number of vibra- 
tions is equal to the correct value 3Nv: 


Dey 3 
ala ae Vom _ 3Nv, 


275 2048 


whence 
Om = U(6r?Nv/VY*, (66.1) 


Thus the frequency distribution in this model is given by the formula 


ONrw? dw/w3, (w <0.) (66.2) 


for the number of vibrations with frequencies in the interval de (here i has 
been expressed in terms of o,,). 
Changing from the sum in (64.1) to an integral, we now have 


F=Neo+T:- ral w? log (1—e~*#/T) do. 
m 
0 


The Debye temperature or characteristic temperature © of the body is defined 
by 
O = ho, (66.3) 


(and is, of course, dependent on the density of the body). Then 
O/T 
F = Neo+9NvT(T/OP { 27 log (1 —e-*) dz. (66.4) 
0 


Integrating by parts and using the Debye function 


x 


D(x) = | or (66.5) 


we can rewrite this formula as 


F = Neo+-NvT[3 log (1—e-9/T)— D(O/T)}. (66.6) 
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Hence the energy E = F—TOF/OT is 
E = Neot3NoTD(O/T) (66.7) 
and the specific heat is 
C = 3N{D(O/T)—(0/T) D’(O/T)}. (66.8) 
Figure 8 shows a graph of C/3Nv as a function of T/O. 
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Formulae (66.6) — (66.8) are the required interpolation formulae for the 
thermodynamic quantities for a solid (P. Debye, 1912). 

It is easy to see that in both limiting cases these formulae in fact give the 
correct results. For T< © (low temperatures) the argument 9/T of the 
Debye function is large. In a first approximation we can replace x by in 
the upper limit of the integral in the definition (66.5) of D(x); the resulting 
definite integral is 24/15, and sot 


D(x) = 24/5x8 (x > 1). 
Substituting this in (66.8), we obtain 
C = (12Nrn*/5) (T/0)*, (66.9) 


which is the same as (64.9). At high temperatures (7 >> ©) the argument of 
the Debye function is small; for x < 1 we have D(x) = 1 to a first approx- 


t Replacing { by f J , expanding (e7— 1)~ in the second integrand in pow- 


ers of e7?, aad intesiating term by term, we find that, for x >> 1, 
es 
D(x) = =3 —3e7*{1+ O11 /x)}. 


The value given in the text is therefore correct to within exponentially small terms. 
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imation,’ and (66.8) gives C = 3N», again in full agreement with the previous 
result (65.5).+ 

It is useful to point out that the actual form of the function D(x) is such 
that the criterion of applicability of the limiting expressions for the specific 
heat is the relative magnitude of T and 40: the specific heat may be regarded 
as constant for T >> 40 and proportional to T? for T< 40 

According to Debye’s formula, the specific heat is some universal function 
of the ratio O/T. In other words, according to this formula, the specific 
heats of bodies must be the same if the bodies are in corresponding states, 
i.e. have the same value of 0/T. 

Debye’s formula gives a good description of the variation of specific heat 
with temperature (as far as can be expected from an interpolation formula) 
only for certain substances with simple crystal lattices: most of the elements, 
and some simple compounds such as the halides. It is inapplicable in prac- 
tice to substances of more complex structure; this is quite reasonable, since 
in such substances the vibration spectrum is extremely complicated. 


§ 67. Thermal expansion of solids 


The term proportional to T* in the free energy at low temperatures (64.6) 
can be regarded as a small correction to Fy = Neo(V/N). The small correction 
to the free energy (for given V and T’) is equal to the small correction to the 
thermodynamic potential ® (for given P and T; see (15.12)). We can therefore 
write immediately 


® = Oo P)—2T*V (P)/30(Hi) (67.1) 


Here ®o(P) is the temperature-independent part of the thermodynamic poten- 
tial, Vo(P) the volume expressed as a function of pressure by means of the 
relations P = —OFo/OV = —Ndeo/dV, and & = u(P) is the mean velocity of 
sound, expressed in terms of the pressure by means of the same relations. The 


t For x < 1 a direct expansion of the integrand in powers of x and integration 
term by term gives 


Dix)= 1-3xt+He—-... . 
t The specific heat at high temperatures accurate to the next term in the expan- 
sion is 
C = 3Nr{1—Z,(0/T. 
§ As examples, the values of 9 for a number of substances, derived from their 


specific-heat values, are Pb 90°, Ag 210°, Al 400°, KBr 180°, NaC! 280°. For dia- 
mond, @ is particularly large, ~ 2000°. 
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dependence of the volume of the body on the temperature is given by V= 
OO/OP: 


nT 4 d Vo 
V = VAP) sos a5 (ze >); (67.2) 
The thermal expansion coefficient is 
<= ac, 27°T? d Vo 
@ = (IV) (VT )» = — Tae (=: (67.3) 


We see that at low temperatures « is proportional to the cube of the temper- 
ature. This result is already obvious from Nernst’s theorem (§ 23) together 
with the T* law for the specific heat. 

Similarly, at high temperatures we can consider the second and third terms 
in (65.6) as small corrections to the first term (we must always have T < eo 
if the body is solid), and obtain 


® = @(P)—Nc log T+.NcT log ha(P), (67.4) 


whence 
V = Vo(P)-+(NcT/&) d@/dP. (67.5) 


The thermal expansion coefficient is 
a = (Nce/Vo) da/dP, (67.6) 


and is independent of the temperature. 

When the pressure increases, the atoms in a solid come closer together, 
and the amplitude of their vibrations (for a given energy) decreases, i.e. the 
frequency increases. Thus d@/dP > 0, so that « > 0, and solids expand 
when the temperature rises. Similar considerations show that the coefficient 
a given by formula (67.3) is also positive. 

Finally, we can make use of the law of corresponding states given at the 
end of § 66. The statement that the specific heat is a function only of the ratio 
T/O is equivalent to saying that the thermodynamic potential, for example, is 
of the form 

® = Oo P)+Of(T/O). (67.7) 


The volume is 


V = Vo(P)+(dO/dP) [f—(T/O) 1, 
and the thermal expansion coefficient is 
a = —(T/V.O0?)(dO/dP)f”’ . 


Similarly, we find the heat function W = ©—T0@/eT and the specific heat 
C= ow/eT: 
= —(T/O)f”. 
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Taking the ratio of the two expressions for « and C, we obtain 


(67.8) 


Thus, within the limits of validity of the law of corresponding states, the 
ratio of the thermal expansion coefficient to the specific heat of a solid is 
independent of temperature (Griineisen’s law). 

It has already been mentioned that in solids the difference between the 
specific heats C, and C, is very slight. At low temperatures this is a general 
consequence of Nernst’s theorem, which applies to all bodies. At high 
temperatures we have, using the thermodynamic relation (16.9), 


C=C, ==7F 


where « = a(P) is the thermal expansion coefficient (67.6). We see that the 
difference C,—C, is proportional to T; essentially this means that its expan- 
sion in powers of T/eo begins with a first-order term, whereas that of the 
specific heat itself begins with a zero-order (constant) term. Hence it follows 
that in solids C,—C, « C at high temperatures also. 


§ 68. Highly anisotropic crystals 


It has been noted at the end of § 66 that Debye’s formula is inapplicable in 
practice to crystals of complex structure. These include, in particular, highly 
anisotropic crystal structures of the “layer” and “chain” types. The former 
may be described as consisting of parallel layers of atoms, the interaction 
energy of the atoms within each layer being large in comparison with the 
coupling energy of adjacent layers. Similarly, the chain structures consist 
of parallel chains of atoms that are relatively weakly coupled together. The 
acoustic vibration spectrum of such crystals has not one but several Debye 
temperatures with different orders of magnitude. The T° law for the specific 
heat then applies only at temperatures that are low compared with the lowest 
Debye temperature; in the intermediate regions, different limiting laws 
occur (I. M. Lifshitz, 1952). 

Let us first consider layer structures. Such a lattice is most rigid with re- 
gard to vibrations of atoms in the plane of the layers, which we take as the 
xy-plane; the rigidities for vibrations of one layer as a whole relative to the 
others are relatively very small. These properties bring about a dependence 
of the frequency on the wave vector (the dispersion relation) in three branches 
of the acoustic wave spectrum, as shown by the following formulae (which 
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are written here for the case of a crystal with hexagonal symmetry): 
= UE+ HR, of = UbO +d, 


(68.1) 
oe = bx®+uik2, (0? = k2+k), 


with U1, Us >> us, us. Here the velocities of propagation U1, Us refer to 
vibrations of atoms in the plane of the layers, us (in the branches w1 and we) 
to shear vibrations of layers with respect to one another, and 1 to vibrations 
of the relative distance between the layers.' 

The expressions (68.1) are, however, insufficient for studying the thermal 
properties of the crystal. They are in fact only the first terms in the expansion 
of the functions w*(k) in powers of the wave vector. In view of the “anoma- 
lous” smallness of some of the coefficients in the quadratic terms in such 
expansions, terms of the next (fourth) order begin to be important.t To 
ascertain their form, we note that, when the coupling between layers is 
entirely neglected, the dispersion relations of the waves become 


= Ui’, wi = Ux, wi = yx. (68.2) 


The frequencies w and w2 correspond to longitudinal vibrations in the plane 
of the layers, and ws to transverse vibrations, which in this case are bending 
waves of the layers, regarded as free elastic thin plates (cf. Elasticity, § 25). 
Hence, neglecting the small fourth-order terms that depend on the coupling 
between the layers, we can finally write the wave dispersion relation as 


OF 9 = Ute? +18k2, wh = Bx? + uk? + yt. (68.3) 


We shall assume that U1 ~ Us, us ~ ua, and use the notation 7 ~ u/U for 
the small ratio which represents the coupling energy between the layers 
relative to that between the atoms in one layer. We shall also use the Debye 
temperature, or more precisely the highest of the Debye temperatures, 


t The assumption of hexagonal symmetry of the crystal has no fundamental 
significance, and is made only in order to give formulae (68.1) a more definite 
form. The velocities U,, ..., ug, are expressed in terms of the elastic moduli Ajxim 
of such a crystal by 


U7 = | Q. UZ = Axxxx/0, ug a peer | Q, us = A esel Q. 


where ¢ is the density; these can be derived from the expressions in Elasticity, 
§ 23, Problem, by expanding in powers of the moduli /,,,, and A,,,,, which for a 
layer crystal are small in comparison with 1... and Axy,y. The type of vibration 
mentioned in the text is evident from the sense of the various components Ajpin- 


t The equation giving the dispersion relation of waves is an algebraic equation 
for w? (see § 69). It is therefore the function c*(k) that has a regular expansion in 
powers of k,, k,, k,. Since this function is even (see § 69), the expansion contains 
only even powers. 
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O = fw,,, wherew,, ~ U/a is the limiting frequency of the “hard” vibrations 
(a being the lattice constant); the limiting frequency of the “soft” vibrations 
is small in comparison with @,,, in the ratio 7. Lastly, it is reasonable to 
suppose that the limiting frequency of the bending waves is of the same order 
as of less than ,,; let it be ~ ,,.' Under these conditions, let us find the 
nature of the temperature dependence of the specific heat of the crystal 
when T <« 9.¢ 

With inclusion of the contribution from acoustic vibrations, the free 


energy of the body is given by 


V dk, dk, dk, 


=a? (68.4) 


3 
F = NeotT ¥, (log (1—e-") 
axl 


where the summation is taken over the three branches of the spectrum, and 
the integration over the whole range of variation of the wave vector. 

If T >> 7@, we can neglect the coupling between layers, and therefore use 
the spectrum (68.2). The main contribution to the free energy comes from 
the “bending” branch ws. Because of the rapid convergence when T « 9, 
the integration over k, and k, can be extended from — © to oo. Replacing 
it by integration over 22x dx, we find by an obvious substitution 


oo eo 


| log (1 —e-*”"!T)2nx dx = - ( log (1 —e-*) dx. 
5 


0 

The integration over k,, with range |k,| <k, ma, ~ 1/a, gives a tempera- 

ture-independent factor ~ 1/a. The result is that the temperature-dependent 

part of the free energy is proportional to 7? and correspondingly, for the 
specific heat, 

CoT for fO«T«@. (68.5) 


If T < nO, we must use the full expressions (68.3) for w,(k) in the integrals 
(68.4), and the integration over each component of k can be extended from 
— co to co. The resulting temperature dependence of the free energy is fairly 
complicated, but two further limiting cases may be distinguished. If T >> 770, 
the main contribution again comes from the branch ws, and the term in x? 
may be omitted, leaving 


a = Et Pt 


t That is, we take y ~ w,,a7 ~ Ua. It must be emphasised that the coefficient y, 
which is related to the “transverse rigidity” of the layers, is not expressible in terms 
of the elastic moduli A,x;,, alone. 

t High temperatures J >> © form the classical range, in which the specific heat 
C is constant. 

§ That is, over one reciprocal lattice cell; see (71.7). 
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This follows since the most important range in the integral over x dx is that 
where fiyx? ~ T, for which hux'~ hu(T/hy!” ~ T(?O/T)? « T. We then 
have 


{ og [1 -exp -# Vv Catt 7*9] ‘enldy dk. 
Ed 


= constant XT?/usy, 
and so for the specific heat 
Col? for rO«T «7. (68.6) 


Lastly, when T < 7?@ we can show by the same method that the term in 
x4 can be omitted from (68.3), and we then return to the acoustic spectrum 
(68.1) with w linearly dependent on k and the specific heat given by Debye’s 
law 


CaT®? for T«7?@. (68.7) 


Crystals with a chain structure can be treated similarly; we shall take the 
direction of the chains as the z-axis. In this case, the dispersion relations in 
the three branches of the acoustic-wave spectrum are 


OF 9 = Ub ox? + ubk2+y7k4, wd = ube? + U3K?, (68.8) 


where now 11, U2, us « U,.' If the interaction between the chains is neg- 
lected, the relations (68.8) become 


oh = 7, ah = UWE: 


the branch ws corresponds to longitudinal vibrations of the atoms in the 
chains, and the branches w and wg to bending waves in the chains, regarded 
as elastic strings: If we assume uw. ~ we ~ us and again use the small para- 
meter 7 ~ u/U and the Debye temperature © ~ AU/a, we can derive the 
following limiting forms for the temperature dependence of the specific 
heat: 


Cat” for nMO«T«0O, 
Ceo TS? for 7?O«T «70, (68.9) 
CaT*® for T «70. 


* Here we again take the particular case of hexagonal symmetry, in this in- 
stance about the direction of the chains The velocities u,, ..., U, are expressed in 
terms of the elastic moduli by the same formulae as in the first footnote to this 
section, but Ayxxx+ Axyxy and A,2x2 are now small in comparison with A,,,,- 
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§ 69. Crystal lattice vibrations 


In the preceding sections, we have considered the thermal motion of atoms 
in a solid as a set of normal modes of small vibrations of the crystal lattice. 
Let us now consider more closely the mechanical Se of these vibra- 
tions. 

Each unit cell of the crystal generally contains several atoms. It is therefore 
necessary, to specify cach atom by stating which unit cell contains it and 
giving the number of the atom in that cell. The position of the unit cell can be 
defined by the radius vector r, of any particular vertex of it; this takes values 
given by 


Ty, = ya, +NeAe+ngas, (69.1) 


where 71, M2, Ns are integers and aj, ae, ag the basis lattice vectors corre- 
sponding to the edge lengths of the unit cell. 

We shall denote the displacements of the atoms in their vibrations by u,, 
where the suffix s is the number of the atom in the cell (s = 1, 2, 
with » the total number of atoms per cell). The Lagrangian of a crystal pine 
as a mechanical system of particles executing small vibrations about their 
equilibrium positions (lattice sites) is 


= FY mak) 4 FAW O—w ula) uyelw)- (69.2) 


ae 


where the “vector” n = (7, Mg, M3), m, are the masses of the atoms, and i, k 
are vector suffixes taking the values x, y, z; summation over repeated suffixes 
is, as usual, implied. The coefficients A depend only on the differences n—n’, 
since the interaction_forces between atoms can depend only on the relative 
Position_of of | the lattice cells, not on their absolute position in space. These 
coefficients are symmetrical: 


ie (n) = Agi(—n), (69.3) 


as is evident from the form of the function (69.2). 
The equations of motion 


Mil, = — >» Af (a—n’) uy,(a’) (69.4) 


follow from the Lagrangian (69.2). Note that the coefficients A satisfy certain 
relations which express the fact that a parallel displacement or a rotation of 
the lattice as a whole gives rise to no forces on the atoms. For a parallel 
displacement, all the u,(n) are constant, and therefore 


£ 4¢@) = (69.5) 


15 
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We shall not pause to write out the relations that follow from the invariance 
under rotations. 
We shall seek solutions of the equations (69.4) in the form of a monochro- 


matic plane wave 
u,(n) = e,(k) exp [i(ker,—/)]. (69.6) 


The (complex) amplitude e, depends only on the suffix s, i.e. differs only for 
different atoms in the same cell, but not for equivalent atoms in different 
cells. The vectors e, determine both the vibration amplitude and the direc- 
tion of polarisation of the vibrations. 

Substitution of (69.6) in (69.4) gives 


w*m,e,, exp (iker,) = >» AS (n—n’)ey, exp (iker,,). 


On dividing each side by exp (iker,) and replacing the summation over n’ by 
one over n’—n, we find 


Y AB (eer — omen = 0, (69.7) 


where 
At(k) = >) Aig(n) exp (— iker,). (69.8) 


The set of linear homogeneous algebraic equations (69.7) for the amplitudes 
has non-zero solutions if the consistency equation 


det | 45k) —w?m,6,6,,-| = 0 (69.9) 


is satisfied. Since the suffixes i and k take 3 values each, and s and s’ v values 
each, the order of the determinant is 3», so that (69.9) is an algebraic equa- 
tion of degree 3y in w?. 

Each of the 3y solutions of this equation determines the frequency w as a 
function of the wave vector k; this is called the dispersion relation of the 
waves, and equation (69.9), which determines it, is called the dispersion equa- 
tion. Thus for any given value of the wave vector the frequency can in gen- 
eral take 3 different values. We can say that the frequency is a many-valued 
function of the wave vector, with 3» branches: w = @.(k), where the suffix « 
labels the branches of the function. 

From the definition (69.8) and the equations (69.3), it follows that 


Ak) = Ati(—k) 
= [AG(KI". (69.10) 
Thus the quantities Af{(k) form an Hermitian matrix, and the solution of 


equations (69.7) is mathematically one of determining the eigenvalues and 
corresponding “eigenvectors” of such a matrix. According to the known 
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properties of Hermitian matrices, the eigenvectors corresponding to differ- 
ent eigenvalues are orthogonal. In the present case, this means that 


YS maPeue*=0 for «#a’, (69.11) 
gl 


where the superscript («) to the displacement vector indicates the branch of 
the vibration spectrum to which it belongs. The equations (69.11) express 
the orthogonality of the polarisations in different branches of the spectrum. 

Owing to the symmetry of the mechanical equations of motion under 
time reversal, if the propagation of a wave (69.6) is possible, then so is that 
of a similar wave in the opposite direction. This change of direction is equiv- 
alent to a change in the sign of k. The function w(k) must therefore be even: 


co(—k) = o(k). (69.12) 


The wave vector of the lattice vibrations has the following important 
property. The vector k appears in the expression (69.6) only through the 
exponential factor exp (iker,). But this factor is unchanged when k is replaced 


according to 
k->+k+b, b= pibi+pobe +psbs, (69.13) 


where b is any vector of the reciprocal lattice, bi, be, bs are the basic vectors 
of that lattice, and pj, pe, ps are integers. This means that the wave vector 
of the lattice vibrations is physically indeterminate: values of k differing by 
b are physically equivalent. The function w(k) is periodic in the reciprocal 


lattice: 
w(k +b) = w(k), 


and in each branch it is therefore sufficient to consider values of the vector k 
that lie in a certain finite range, a single cell of the reciprocal lattice. If the 
coordinate axes (in the general case, oblique) are along the three basic vectors 
of the reciprocal lattice, we can take, for example, just the range 


—$b; < k, < $b;. (69.14) 


When k takes values in this range, the frequency w(k) in each branch of 
the spectrum takes values that occupy a band or zone of finite width. Differ- 
ent zones may, of course, partly overlap. 

In geometrical terms, the functional! relation w = w(k) is represented by a 
four-dimensional hypersurface, different sheets of which correspond to 
the different branches of the function. These sheets may be not completely 


t The occurrence of the “weighting” factor m, in the relations (69.11) is duz to 
the fact that the w?2 are eigenvalues not of the matrix A‘'(k) itself, but of 
sc l4/(myny), and the corresponding eigenvectors are «/mu®. 
t The concepts used here are considered in detail in § 133. 


is* 


210 Solids 


separate, i.e. they may intersect. The possible types of such intersections 
depend considerably on the specific symmetry of the crystal lattice. The 
study of this topic involves the application of group theory (see § 136). 

Among the 3» branches of the vibration spectrum, there must be some 
which for wavelengths large compared with the lattice constant correspond 
to ordinary elastic (i.e. sound) waves in the crystal. It is known (see Elasticity, 
§ 23) that waves of three types with different dispersion relations can be 
propagated in a crystal regarded as a continuous medium, and for all three 
types w is a homogeneous function of the first order in the components of the 
vector k, and vanishes when k = 0. Thus the 3» branches of the function 
«(k) must include three for which the dispersion relation when k is small 


has the form 
w = kf(k/k). (69.15) 


These three types are called acoustic waves; they are characterised by the 
fact that (when k is small) the lattice vibrates as a whole, as a continuous 
medium. In the limit as k ~ 0, these vibrations become a simple parallel 
displacement of the entire lattice. 

In complex lattices containing more than one atom per cell, there are a 
further 3(v—1) types of wave. In these branches of the spectrum, the fre- 
quency does not vanish when k = 0, but tends to a constant limit as k — 0. 
These vibrations of the lattice are called optical vibrations. In this case the 
atoms in each unit cell are in relative motion, and in the limit k = 0 the 
centre of mass of the cell remains fixed.t 

The 3v—3 limiting frequencies (at k = 0) of the optical vibrations need 
not be all different. When the crystal has certain symmetry properties, the 
limiting frequencies of some of the optical branches of the spectrum 
may coincide or be degenerate (see § 136). 

The function w(k) with a non-degenerate limiting frequency can beexpand- 
ed (near k = 0) in powers of the components of the vector k. Since the 
function w(k) is even, this expansion can contain only even powers of k,, 
and its leading terms are therefore 


© = Wo+sVukike, (69.16) 


where Wp is the limiting frequency and the y, are constants. 


t This last fact can be formally deduced directly from the equations of motion 
(69.7), (69.8). For k = 0, these become 


2, A Mere = m,we,. 
a, 


Summing both sides over s, we have zero on the left, by (69.5), and so we must 
also have 2m,e, = 0 if the equations are consistent when k = 0. 
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If, however, the limiting frequencies of several branches coincide, the 
functions w(k) for these branches cannot be expanded in powers of k, 
since the point k = 0 is a singular one (a branch point) for them. We can 
say only that, near k = 0, the difference w—w will be a homogeneous 
function of the components of k, of either the first or the second order 
(depending on the symmetry of the crystal). 

Concerning the whole of the foregoing discussion, it must be emphasised 
once more that this has related only to the harmonic approximation, in 
which only those terms are taken into account, in the potential energy of the 
vibrating particles, which are quadratic in the displacements of the atoms. 
It is only in this approximation that the various monochromatic waves 
(69.6) do not interact but are freely propagated through the lattice. When the 
Subsequent anharmonic terms are taken into account, various processes of 
decay and scattering of these waves by one another appear. The interaction 
may also lead to the formation of “bound states” of waves (phonons; see 
below), as new branches of the spectrum that do not exist in the harmonic 
approximation. 

Moreover, it is assumed that the lattice is perfectly periodic, but it must 
be borne in mind that the perfect periodicity is to some extent perturbed, 
even without allowing for possible impurities and other lattice defects, if the 
crystal contains randomly distributed atoms of different isotopes. This 
perturbation, however, is comparatively small if the relative difference of 
atomic weights of the isotopes is small or if the abundance of one isotope 
greatly exceeds those of the others. In such cases the above description 
remains valid in a first approximation, and in subsequent approximations 
there occur various processes of scattering of waves by inhomogeneities 
in the lattice.‘ 


§ 70. Number density of vibrations 


The number of vibrations corresponding to a range d°k = dk, dk, dk, of the 
values of components of the wave vector is d°k/(2x)8 per unit volume of the 
crystal. The frequency distribution function g(w) of the vibrations, which 
gives the number g(@) dw of vibrations whose frequencies lie in a given range 
between w and w+dw, is a characteristic of the vibration spectrum of a 
particular lattice. This number is, of course, different for different branches 


t The presence of lattice defects also brings about certain changes in its vibra- 
tion spectrum, with new frequencies that correspond to “local” vibrations near 
the defects. For a study of these effects see ILM. Lifshitz and A.M. Kosevich, 
Reports on Progress in Physics 29, 217, 1966. 
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of the spectrum, but to simplify the notation we shall omit in this section the 
appropriate suffix « to the functions w(k) and g(a). 

The number g(w) dw is the volume (divided by 87°) in k-space that lies 
between two infinitesimally close surfaces of constant frequency w(k) = 
constant. At each point in k-space, the gradient of the function w(k) is along 
the normal to the constant-frequency surface through the point. It is there- 
fore clear from the expression dw = dk-v,«(k) that the distance between two 
such infinitesimally close surfaces (measured along the segment of the nor- 
mal between them) is dw/|V,@]|. Multiplying this by the area df, of the 
constant-frequency surface element and integrating over the whole surface 
within one cell of the reciprocal lattice, we find the required part of the vol- 
ume of k-space, and division by (27) gives the frequency distribution density : 

] fx 
8) = Gay | rods a 

In each zone (range of values traversed by some branch w(k) in one cell 
of the reciprocal lattice of k) the function w(k) must have at least one mini- 
mum and one maximum. Hence it follows that this function must also have 
saddle-points.t The existence of these various stationary points leads to 
certain properties of the frequency distribution function g(w) (L. Van Hove, 
1953). 

Near an extremum at some k = ko, the difference w(k)—@o, where @o = 
(Ko), has the form 


W—Wy = Fyilk,—koi) (kx—kox). 


Taking the axes in k-space along the principal axes of this quadratic form, 
we can write it as 


O—Wo >= Flyi(k = Kox)* + yolk, ae Koy)? + ya(kz rr ko;)"], (70.2) 


where ,, Ya, Ys are the principal values of the symmetrical tensor y,,. 

Let us first consider a minimum or maximum of w(k). Then 71, ye, v3 have 
the same sign. Using instead of k,, k,, k, the new variables x,, x,, %, accord- 
ing tox, = +/|7,| (K,—Ko,), etc., we have 


O—Wy = + F(x2 +22 +22) = +42, (70.3) 


In this case the constant-frequency surfaces in %-space are spheres. Changing 
to integration in #-space in (70.1), we have 


2(o) of 


= Gnyvy Tua)? y = lyryaysl. (70.4) 


T It can be shown (but we shall not pause to do so) that there must exist at 
least six saddle-points, three of each of two types, corresponding to the two signs 
in (70.8). 
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The surface element of the sphere is df, = x?do,, where do, is the element 
of solid angle. The gradient of the function (70.3) is v,w(#) = +. The 
integral in (70.4) is therefore equal to 4x; expressing x in terms of w — Wo by 
(70.3), we have finally 


g(@) = a) af “) ~~ /|@—ao). (70.5) 
Thus the vibration number density has a square-root singularity; the deriv- 
ative dg/dw becomes infinite as w — wo. 

It must be borne in mind, however, that in the general case (if w = wo 
lies within and not on the edge of the frequency band) the constant-frequency 
surfaces for values of w close to @o may include other sheets in other parts of 
the k-space cell, as well as the ellipsoids round k = ko. In the general case, 
therefore, the expression (70.5) gives only the “singular” part of the vibration 
number density, and it would be more correct to write 


= 4/|@ — 00] 

(0) = elo) + es (70.6) 
on one side of @ = wo (@ < @o for a maximum, or @ > Wo for a minimum), 
and g(w) = g(@o) on the other side. 

Note also that formula (70.5) does not, of course, relate to the neighbour- 
hood of the lower edge ( = 0) of the acoustic vibration zone, where the 
dispersion relation has the form (69.15). It is easily seen that in this case 


g(@) = constantXo?. (70.7) 


Let us now consider the neighbourhood of a saddle-point. In this case 
two of the quantities y1, y2, ys in (70.2) are positive and one negative, or 
vice versa. Instead of (70.3) we now have 


O— Wy = £E(x2 +22—x?). (70.8) 


Let us take the specific case of the upper sign. Then the constant-frequency 
surfaces for @ < wo are hyperboloids of two sheets, and those for w > wo are 
hyperboloids of one sheet; the limiting surface w = wo is a double cone 
(Fig. 9, p. 214). 

The integration in (70.4) is now conveniently performed in cylindrical 
polar coordinates in x-space; x,, x,,@, where x, = +/(x; +) and @ is the 
polar angle in the x,x,-plane. The magnitude of the gradient is | V,| = x. 
For @ < Wo the integral is taken over the two sheets of the hyperboloid: 


27m x 1 27x, dx 
d , 4 = ! = d 9 ¢ @ = aa? so ee ren * 
In = Tat Oks 8) = 7, { Vba, + 2(@)—0)] 
0 
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the upper limit K (whose value does not affect the form of the resulting 
singularity) may be taken as any value of x large compared with /(wo—©), 
but also so small that the expression (70.8) still gives the form of the constant- 
frequency surface. The result is 


1 
g(@) = int Jy [K —4/{2(wo—a)}}. 
For > ao, we similarly find 
K 
2 27H 1 dx, K 


~ QaPVvy | VPA —Ao-—a)}  2nPr/y’ 


* t min 


g(@) 


where **, sin = 2(c—o). Thus the vibration number density near the saddle- 
point is 
( £(@0)—Vloo—o|/n?-(2y) for a= ay gy 


a as \ 2(@o) for W>o. 


Here again g(w) has a square-root singularity. 

For a saddle-point with the lower sign in (70.8), the result is similar with 
the regions w < wo and @ > wo interchanged (square-root singularity for 
@ > Wo). 
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§ 71. Phonons 


Let us now consider the quantum treatment of lattice vibrations. 

Instead of the waves (69.6), in which the atoms have definite displacements 
at any instant, quantum theory uses the concept of phonons, which are 
“quasi-particles” propagated through the lattice, with definite energies and 
directions of motion. Since the energy of an oscillator in quantum mechanics 
is an integral multiple of Aw (where w is the frequency of the classical wave), 
the phonon energy « is related to the frequency w by 


e = ho, (71.1) 


in the same way as for light quanta or photons. The wave vector k determines 
the quasi-momentum p of the phonon: 


p = Ak. , (71.2) 


This is a quantity in many ways analogous to the ordinary momentum, but 
there is an important difference between them due to the fact that the quasi- 
momentum is defined only to within an arbitrary additive constant vector of 
the form fb; values of p differing by such a quantity are physically equiva- 
lent. 

The velocity of a phonon is given by the group velocity of the correspond- 
ing classical waves, Vv = 0w/0k. This formula may also be written 


v = Ge(p)/ep, (71.3) 


which is exactly analogous to the usual relation between the energy, momen- 
tum and velocity of particles. 

The whole of the discussion in §§ 69 and 70 of the properties of the classical 
vibration spectrum of a crystal lattice is entirely applicable (with the appro- 
priate change of terminology) to the phonon energy spectrum, i.e. the de- 
pendence of the phonon energy on the quasi-momentum. In particular, the 
phonon energy spectrum «(p) has 3» branches, including three acoustic 
branches. The vibration number density considered in § 70 here becomes 
the phonon quantum state number density. 

In the quantum picture, free movement of non-interacting phonons takes 
the place of free wave propagation in the harmonic approximation. In sub- 
sequent approximations, various elastic and inelastic phonon collision pro- 
cesses occur. These collisions provide the mechanism for the establishment 
of thermal equilibrium in the phonon gas, i.e. of an equilibrium thermal 
motion in the lattice. 

The laws of conservation of energy and quasi-momentum must be satisfied 
in all such processes. The latter law, however, demands the conservation of 
the total quasi-momentum of the phonons only to within any additive vector 
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of the form Ab, because of the non-uniqueness of the quasi-momentum itself. 
Thus the initial and final quasi-momenta p and p’ in any phonon collision 
process must be related by’ 


Sp = 2p’ +Ab. (71.4) 


Any number of identical phonons may be created simultaneously in the 
lattice. That is, any number of phonons may be in each of the phonon quan- 
tum states (in the classical picture, this corresponds to arbitrary wave inten- 
sity). This means that the phonon gas obeys Bose statistics. Since, further- 
more, the total number of particles in this gas is not given and is itself 
determined by the equilibrium conditions, its chemical potential is zero 
(see § 63). The mean number of phonons in a given quantum state (with 
quasi-momentum p and energy «) is therefore determined in thermal equi- 
librium by Planck’s distribution function: 


Tp = 1/(eovT—1). (71.5) 


It may be noted that at high temperatures (T > ©) this expression becomes 


Ny = T/e(p), (71.6) 


i.e. the number of phonons in a given state is proportional to the tempera- 
ture. 

The concept of phonons is a particular case of a more general one that is 
fundamental in the theory of quantum energy spectra of all macroscopic 
bodies. Any low excited state of a macroscopic body may be regarded in 
quantum mechanics as an assembly of separate elementary excitations, which 
behave as quasi-particles moving in the volume occupied by the body. So 
long as the number of elementary excitations is sufficiently small, they do not 
“interact” with one another (i.e. their energies are additive), and so the 
assembly of them may be regarded as an ideal gas of quasi-particles. It 
should again be emphasised that the concept of elementary excitations arises 
as a means of quantum description of the collective motion of the atoms in 
the body, and they cannot in any way be identified with the individual atoms 
or molecules. 

For phonons, their interaction corresponds (in the classical picture) to 
anharmonic vibrations of the atoms in the lattice. But, as already mentioned 
in § 64, these vibrations in solids are in practice always small, and are there- 
fore “almost harmonic”. The interaction of phonons in solids is therefore 
always weak in practice. 


t Processes in which the total quasi-momentum does not remain constant but 
changes by fb are called Umklapp processes. 
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In conclusion, we shall give the formulae for the thermodynamic quantities 
of a solid in terms of the phonon spectrum in it. 

The free energy of a solid in thermodynamic equilibrium is given by (64.1). 
Changing from summation to integration over a continuum of phonon states, 
we have 

3e 3 
r= NeotT {los |! ex (-==)| a (71.7) 
where the summation is over all branches of the spectrum and the integration 
over the values of k in one reciprocal lattice cell. In terms of the number 
density of states g,(w) in each branch of the spectrum, with a change to 
integration over frequencies, we can also write this formula as 


Sy 
F = Neg+TV ¥° { log (1—e—*'7) g,(w) doo. (71.8) 
asl 


The non-equilibrium macroscopic state of a solid is described by some non- 
equilibrium distribution of phonons among their quantum states, as for an 
ideal gas. The entropy of a body in such a state can be calculated by means 
of the formulae derived in § 55 (for a Bose gas). In particular, when there are 
many phonons in each state, the entropy is 


S= 3G; log (eN;/G;), 
where N, is the number of phonons in a group of G, neighbouring states (see 
(55.8)). This case corresponds to high temperatures (T > @). 

We can rewrite this formula in an integral form corresponding to the 
classical picture of thermal vibrations. The number of phonon states (of 
each branch of the spectrum) which correspond to the interval d°k of values 
of the wave vector and the space volume element dV is 


d®k dV 

(22)> 
Let U,(r, k) dr be the energy of thermal vibrations in the same phase-space 
element dr. The corresponding number of phonons is U,(r, k) dt/hw,(k). 
Substituting these expressions for G, and N, and changing to integration, 
we have the following formula for the entropy of a solid with a given non- 
equilibrium distribution of energy in the spectrum of thermal vibrations: 


drt = 


S= y J log [eU.(r, k)/fo.(k)] dr. (71.9) 


t This formula has already been used in § 68 for the free energy contribution 
from the acoustic branches of the spectrum. 
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§ 72. Phonon creation and annihilation operators . 


We shall now show how the ideas defined in § 71 occur in the consistent 
quantisation of the lattice vibrations. The formulae obtained also have 
independent significance, since they form the basis for the mathematical 
formalism in the analysis of phonon interaction events. 

Any vibrational motion of the crystal lattice can be represented as a super- 
position of travelling plane waves.‘ If the volume of the lattice is taken to be 
large but finite, the wave vector k will take a series of close but discrete values. 
The atomic displacements u,(¢, n) are then given by the discrete sum 


Lyi . 
ux(t,m) = Ty 2, 2 [aesel ajo + ees Oe], (72.1) 


where N is the number of unit cells in the lattice. The summation is taken over 
all (non-equivalent) values of k and over all branches of the vibration spec- 
trum; the remaining notation is as follows. 

The vectors e® in (72.1) are the vibration polarisation vectors, i.e. the 
amplitudes which not only satisfy equations (69.7), but are now assumed 
normalised in a particular way. The normalisation condition is written 
(together with the orthogonality relations (69.11)) as 


Yel) el" "(k) = bau (72.2) 


s= 


where m = 2m, is the total mass of the atoms in one cell. The conditions 
(72.2) still leave arbitrary the common phase factor (independent of s) 
in the vectors e®. This arbitrariness enables us to impose on the vectors the 
further conditions 


ef'(—k) = ef"(k); (72.3) 


the possibility of such a choice is evident, since by (69.10) the vectors on each 
side of equation (72.3) satisfy the same equations. 
The coefficients a,, in (72.1) are functions of time, satisfying the equations 


Gin +0(k) ake =/(); (72.4) 
which are obtained by substituting (72.1) in equations (69.4). We put | 
Ay, oC exp [—iw,(k)¢]; (72.5) 


then each term in the sum depends only on the difference k»r,—«,, i.e. is a 
wave travelling in the direction of k. 


* Exactly as for a free electromagnetic field (cf. Fields, § 52). 
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The vibrational energy of the lattice is expressed in terms of the displace- 
ments and velocities of the atoms by 
E= 4 > maya)t+s > Ag (@—n’) uy(n) uyx(n). (72.6) 
nS pn, 2B 
so 


Here we substitute the expansion (72.1). All terms in the resulting sums that 
contain factors exp[+i(ktk’)-r,] with k+k’ ~ 0 vanish on summation 
over n, since 

yet™—N for q=0, 


=0 for q#0O, 


where q takes all non-equivalent values (see § 133). Using also the conditions 
(72.2) and (72.3), we can transform the kinetic energy into 


2 moze AeaMe + $ (Ged —ke + Bia na) 
a, 


The potential energy in (72.6) can be rewritten by means of the equations of 
motion (69.4), as | 
an $ > m,ii,(n)-u,(n) 
Bn, § 


and then transformed similarly; it then takes a form that differs from the 
kinetic energy only in the sign of the second term in the braces. Addition of 
the two parts of the energy gives 


E= 2 2eneoz(k) | Qa |”. (72.7) 


Thus the total energy of the lattice vibrations is expressed as the sum of the 
energies associated with each wave separately. 

Let us now carry out a transformation of the equations of motion of the 
lattice into the canonical equations of mechanics. To do so, we use real 
“canonical variables” Q,, and P,,, defined by 


Qiu = VM (Aua + Gin)s 
Pra = — i00_(K) \/m( Ayn — ix) = Ona - 
Expressing a,, and a,, in terms of these and substituting in (72.7), we obtain 
the Hamiltonian of the lattice: 
H=% » [ Pia + w2(k)OF.].- (72.9) 


(72.8) 


The Hamilton’s equations 0H/OP,, = O,, are the same as P,, = Q,,. and 
from 0H/0Q,, = —P,, we obtain 


Duc + w2(k) Oe = 0, 


which are the equations of motion of the lattice. 
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Thus the Hamiltonian is represented as a sum of independent terms, each 
in the form of the Hamiltonian of a one-dimensional harmonic oscillator. 
This way of describing a classical vibrational motion makes evident the 
means of the transition to the quantum theory.’ We must now consider the 
canonical variables (the generalised coordinates Q,, and generalised mo- 
menta P,,) aS operators with the commutation rule 


PaO ia— Ot Pre = —ih. (72.10) 


The Hamiltonian (72.9) is replaced by a similar operator, whose eigenvalues 
are known from quantum mechanics: 


E= Y Herel) (Pcs + 2)» Me = 0,1,2... . (72.11) 


This formula enables us to define the concept of phonons as in § 71: the 
excited state of the lattice may be regarded as a set of elementary excitations 
(quasi-particles), each with an energy fim,(k) that is a definite function of the 
parameter (quasi-momentum) k. The quantum numbers n,,, then become the 
occupation numbers of the various quasi-particle states.? 

In accordance with the known properties of the harmonic oscillator in 
quantum mechanics, the quantities w,(k)Q,,+iP,, have non-zero matrix 
elements only for transitions in which the numbers n,, change by unity 
(see Quantum Mechanics, § 23). If we use the operators 


Whe. (EN [o.(k)Oxe oF iB kel 
~ (3) oe 


say [@alk) Ore = iP, .), 


ka = 


= 
JV Join (k)] 
the non-zero matrix elements are 


(Ma— 1 | Gicee| Mya) = (Mga | az | Nya 1) 
= -Vtua (72.13) 


The commutation rules for these operators are obtained from the definition 
(72.12) and the rule (72.10): 


Catia — CiCee = 1. (72.14) 


+ As in the transition from the classical description of a free electromagnetic 
field to the quantum picture of photons (see RQT, § 2). 

t The “zero-point” energy Ztho, which remains in (72.11) when all the My, = 0 
is to be included in the ground-state energy of the body. It is finite (because the 
sum contains only a finite number of terms) and its existence does not here lead 
to any difficulties of principle (in contrast to quantum electrodynamics, where the 
sum \7iw diverges). 
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It is seen from (72.13) that, when acting on functions of the occupation 
numbers, the operators é,, and é* are phonon annihilation and creation 
operators. The rule (72.14) corresponds to Bose statistics, as it should. 

Together with the quantities c,,, the displacement vectors? also become 
operators (in second quantisation): 


a(B) = Vir peatisy ene! WE + else” ee" (72.15) 


From this, the anharmonic terms in the Hamiltonian (terms of the third 
and higher powers in the displacements) can be expressed in terms of prod- 
ucts of various numbers of phonon creation and annihilation operators. 
These terms represent the perturbation that leads to various phonon- 
scattering processes, with changes of the phonon occupation numbers. 


§ 73. Negative temperatures 


Let us now consider some peculiar effects related to the properties of para- 
magnetic dielectrics. In such substances the atoms have angular momenta, 
and therefore magnetic moments, which are more or less freely oriented. The 
interaction of these moments (magnetic or exchange interaction, depending 
on their distance apart) brings about a new “magnetic” spectrum which is 
superposed on the ordinary dielectric spectrum. 

This new spectrum lies entirely within a finite interval of energy, of the 
order of magnitude of the energy of interaction of the magnetic moments of 
all the atoms of the body, lying at fixed distances apart at the crystal lattice 
sites; the amount of this energy per atom may be from tenths of a degree to 
hundreds of degrees. In this respect the magnetic energy spectrum is com- 
pletely different from the ordinary spectra, which, owing to the presence 
of the kinetic energy of the particles, extend to arbitrarily high energy 
values.+ 

Because of this property we can consider an interval of temperatures large 
compared with the maximum possible interval of energy values per atom. 
The free energy F,,,, pertaining to the magnetic part of the spectrum is cal- 
culated in exactly the same way as in § 32. 


t It is easily seen from the definitions (72.8) and (72.12) that the quantities Che 
differ from the a,, only by a factor. 


t The electronic (including the magnetic) spectra of various categories of sol- 
ids will be studied in Part 2. In this section we are considering only the purely 
thermodynamic consequences of the above-mentioned general property of the 
magnetic spectrum. 
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Let E, be the energy levels of the system of interacting moments. 7hen 
we have: for the required partition function 


Zmag = Der 


= E(1-F T +3): 


Here, as in § 32, a formal expansion in powers of the quantity E,/7, which is 
not in general small, will give (after taking logarithms) an expansion in terms 
of asmall quantity ~£,/NT, where Nis the number of atoms. The total num- 
ber of levels in the spectrum under consideration is finite and equal to the 
number of all possible combinations of orientations of the atomic moments; 
for example, if all the moments are equal, this number is g¥, where g is the 
number of possible orientations of an individual moment relative to the 
lattice. Taking the average here as the ordinary arithmetic mean, we can 
write Zing aS 


LE, als 
Finally, taking logarithms and again expanding in series with the same accu- 
racy, we obtain for the free energy the expression 


Finag = —T log Zmag 


= —NT log ¢ +E, — > (E,—E,)"). (73.1) 
Hence the entropy is 
| | Smaz = N log g— a (E,—En))» (73.2) 
the energy | ; | 
| Emus = E,— 5 (E.—E,)) (733) 
and the specific heat | 
Coons = = (E,—En))- (73.4) 


We shall regard the atomic magnetic moments fixed at the lattice sites and 
interacting with one another as a single isolated system, ignoring its interac- 
tion with the lattice vibrations, which is usually very weak. Formulae (73.1)- 
(73.4) determine the thermodynamic quantities for this system at high tem- 
peratures. 

The proof given in § 10 that the temperature is positive was based on the 
condition for the system to be stable with respect to the occurrence of internal 
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macroscopic motions within it. But the system of moments here considered is 
by its nature incapable of macroscopic motion, and so the previous argu- 
ments do not apply to it; nor does the proof based on the normalisation con- 
dition for the Gibbs distribution (§ 36), since in the present case the system 
has only a finite number of energy levels, themselves finite, and so the norma- 
lisation sum converges for any value of T. 

Thus we have the interesting result that the system of interacting moments 
may have either a positive or a negative temperature. Let us examine the 
properties of the system at various temperatures. 

At T = 0, the system is in its lowest quantum state, and its entropy is zero. 
As the temperature increases, the energy and entropy of the system increase 
monotonically. At T = ©, the energy is E, and the entropy reaches its maxi- 
mum value N log g; these values correspond to a distribution with equal 
probability over all quantum states of the system, which is the limit of the 
Gibbs distribution as T - . 

The temperature T = — o is physically identical with T=; the two 
values give the same distribution and the same values of the thermodynamic 
quantities for the system. A further increase in the energy of the system corre- 
sponds to an increase in the temperature from T = — , with decreasing 
absolute magnitude since the temperature is negative. The entropy decreases 
monotonically (Fig. 10).* Finally, at 7 = O—the energy reaches its greatest 


eee sleeve Seana onc CF) 


E 
T=O+ Teto T=0- 
Fic. 10 


value and the entropy returns to zero, the system then being in its highest 
quantum state. 

Thus the region of negative temperatures lies not “below absolute zero” 
but “above infinity”. In this sense we can say that negative temperatures are 
“higher” than positive ones. This is in accordance with the fact that, when a 
system at a negative temperature interacts with one at a positive temperature 
(i.e. the lattice vibrations), energy must pass from the former to the latter 
system; this is easily seen by the same method as that used in§ 9 to discuss the 
exchange of energy between bodies at different temperatures. 


t The curve S = S(E) is symmetrical near its maximum, but in general need not 
be symmetrical far from the maximum. 
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States with negative temperature can be attained in practice in a paramag- 
netic system of nuclear moments in a crystal where the relaxation time fg for 
- the interaction between nuclear spins is very small compared with the relaxa- 
tion time 7; for the spin-lattice interaction (E. M. Purcell and R. V. Pound, 
1951). Let the crystal be magnetised in a strong magnetic field, and let the 
direction of the field then be reversed so quickly that the spins “cannot fol- 
low it”. The system is thus in a non-equilibrium state, with an energy which 
is obviously higher than Z,. During a time of the order of f,, the system reach- 
es an equilibrium state with the same energy. If the field is then adiabati- 
cally removed, the system remains in the equilibrium state, which will clearly 
have a negative temperature. The subsequent exchange of energy between the 
spin system and the lattice, whereby their temperatures are equalised, takes 
place in a time of the order of f1. 


CHAPTER VII 


NON-IDEAL GASES 


§ 74. Deviations of gases from the ideal state 


THE equation of state of an ideal gas can often be applied to actual gases with 
sufficient accuracy. This approximation may, however, be inadequate, and it 
is then necessary to take account of the deviations of an actual gas from the 
ideal state which result from the interaction between its component mole- 
cules. ‘ 

Here we shall do this on the assumption that the gas is still so rarefied that 
triple, quadruple, etc., collisions between molecules may be neglected, and 
their interaction may be assumed to occur only through binary collisions. 

To simplify the formulae, let us first consider a monatomic actual gas. 
The motion of its particles may be treated classically, so that its energy has 
the form 


E,.g= > 2 


@=l1 2 


where the first term is the kinetic energy of the N atoms of the gas, and U is 
the energy of their mutual interaction. In a monatomic gas, U is a function 
only of the distances between the atoms. The partition function [ e~" 97 dP" 
becomes the product of the integral over the momenta of the atoms and the 
integral over their coordinates. The latter integral is 


fi... fe-UT dV... dVn, 


where the integration over each dV, = dx, dy, dz, is taken over the whole 
volume V occupied by the gas. For an ideal gas, U = 0, and this integral 
would be simply VW’. It is therefore clear that, on calculating the free energy 
from the general formula (31.5), we obtain 


F= Fig—T log oy a [ e-urar, ... dy, (74.2) 


wv ov 


where F,, is the free energy of an ideal gas. 
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Adding and subtracting unity in the integrand, we can rewrite formula 
(74.2) as 


F = Fig—T log {pw [ ys | (e-Y/T_1) V1... a+ (74.3) 


For the subsequent calculations we make use of the following formal 
device. Let us suppose that the gas is not only sufficiently rarefied but also 
so small in quantity that not more than one pair of atoms may be assumed 
to be colliding in the gas at any one time. This assumption does not affect 
the generality of the resulting formulae, since we know from the additivity 
of the free energy that it must have the form F = Nf(T, V/N) (see § 24), 
and therefore the formulae deduced for a small quantity of gas are necessarily 
valid for any quantity. 

The interaction between atoms is very smal! except when the two atoms 
concerned are very close together, i.e. are almost colliding. The integrand in 
(74.3) is therefore appreciably different from zero only when some pair of 
atoms are very close together. According to the above assumption, not 
more than one pair of atoms can satisfy this condition at any one time, and 
this pair can be selected from N atoms in +N(N— 1) ways. Consequently, the 
integral in (74.3) may be written 


4 N(N— 1) f at f (4/7 1)dV; ars dn, 


where U2 is the energy of interaction of the two atoms (it does not matter 
which two, as they are all identical); Ui2 depends only on the coordinates of 
two atoms, and we can therefore integrate over the remaining coordinates, 
obtaining V"—*. We can also, of course, write N? instead of N(N—1), since 
N is very large; substituting the resulting expression (74.3) in place of the 
integral, and using the fact that log (1-+x) x x for x < 1, we havet 


TN? 


Fs Fia— 378 


[cewwr—norrers 
where dV, dV 2 is the product of differentials of the coordinates of the two 
atoms. 

But U2 is a function only of the distance between the two atoms, i.e. of 
the differences of their coordinates. Thus, if the coordinates of the two 
atoms are expressed in terms of the coordinates of their centre of mass and 
their relative coordinates, U12 will depend only on the latter (the product of 


t We shall see later that the first term in the argument of the logarithm in (74.3) 
is proportional to N?/V. The expansion in question therefore depends on precisely 
the assumption made above, that not only the density N/V but also the quantity 
of the gas is small. 
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whose differentials will be denoted by dV). We can therefore integrate with 
respect to the coordinates of the centre of mass, again obtaining the volume 
V. The final result is . 

F = Figt+N°TB(T)/V, (74.4) 


where 
BIT) = } f{ d—e-Y7) av. (74.5) 


From this we find the pressure P = —O0F/OV: 


pees (! or}, (74.6) 


V 
since P,, = NT/V. Equation (74.6) is the equation of state of the gas in the 
approximation considered. 

As we know from the theorem of small increments (§ 15), the changes in 
the free energy and the thermodynamic potential resulting from small 
changes in the external conditions or properties of a body are equal, one 
being taken at constant volume and the other at constant pressure. 

If we regard the deviation of a gas from the ideal state as such a change, 
we can change directly to ® from (74.4). To do so, we need only express the 
volume in terms of the pressure in the correction term in (74.4) by means 


of the equation of state for an ideal gas, obtaining 
®D = Dig+ NBP. (74.7) 


The volume may hence be expressed as a function of the pressure: 


v= aT +NB. (74.8) 


The whole of the foregoing discussion applies to monatomic gases. The 
same formulae remain valid, however, for polyatomic gases also. In this 
case the potential energy of interaction of the molecules depends not only on 
their distance apart but also on their relative orientation. If (as almost always 
happens) the rotation of the molecules may be treated classically, we can say 
that U2 is a function of the coordinates of the centres of mass of the mole- 
cules and of rotational coordinates (angles) which define the spatial orienta- 
tion of the molecules. It is easy to see that the only difference from the case 
of a monatomic gas amounts to the fact that dV,, must be taken as the prod- 
uct of the differentials of all these coordinates of the molecule. But the 
rotational coordinates can always be so chosen that the integral f dV, is 
again equal to the volume V of the gas. For the integration over the coordi- 
nates of the centre of mass gives this volume V, while the integration over 
angles gives a constant, and the angles can always be normalised so that this 
constant is unity. Thus all the formulae derived in this section have the 
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same form for polyatomic gases, the only difference being that in (74.5) dV 
is now the product of the differentials of coordinates defining both the dis- 
tance between two molecules and their relative orientation." 

All the above formulae are meaningful, of course, only if the integral (74.5) 
converges. For this to be so it is certainly necessary that the forces of inter- 
action between the molecules should decrease sufficiently rapidly with 
increasing distance: U12 must decrease at large distances more rapidly than 
1/r3,4 

If this condition is not satisfied, a gas consisting of identical particles can- 
not exist as a homogeneous body. In this case every region of matter will be 
subject to very large forces exerted by distant parts of the gas. The regions 
near to and far from the boundary of the volume occupied by the gas will 
therefore be in quite different conditions, and so the gas is no longer homo- 
geneous. 

Ui2 


Fia. 11 


For monatomic gases the function U12(r) has the form shown in Fig. 11; 
the abscissa is the distance r between the atoms. At small distances, U1 in- 
creases with decreasing distance, corresponding to repulsive forces between 
the atoms; beginning approximately at the place where the curve crosses the 
abscissa axis, it rises steeply, so that Uig rapidly becomes very large, corre- 
sponding to the mutual “impenetrability” of the atoms (for which reason the 
distance ro is sometimes called the radius of the atom). At large distances, 
U2 increases slowly, approaching zero asymptotically. The increase of Ui2 
with distance corresponds to a mutual attraction of the atoms. The minimum 


* If the particles in the gas have spin, the form of the function U,, depends in 
general on the orientation of the spins. In that case a summation over spin ori- 
entations must be added to the integration with respect to dV. 


t This condition is always satisfied for atomic and molecular gases: the forces 
of interaction between electrically neutral atoms or molecules (including dipoles), 
when averaged over the relative orientations of the particles, decrease at large 
distances as Uj o¢ 1/r®; see Quantum Mechanics, § 89. 
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of U2 corresponds to a stable equilibrium. The absolute value Uo of the 
energy at this point is usually small, being of the order of the critical temper- 
ature of the substance. 

For a polyatomic gas, the interaction energy has a similar form, but it 
cannot, of course, be represented by the curve in Fig. 11, since it is a function 
of a larger number of variables. 

This information as to the nature of the function U1: is sufficient to deter- 
mine the sign of B(T) in the limiting cases of high and low temperatures. 
At high temperatures (T >> Uo) we have | U12|/T « 1 throughout the region 
r > 2ro, and the integrand in B(T)(74.5) is almost zero. Thus the value of 
the integral is mainly determined by the region r < 2ro, where U42/T is 
large and positive; in this region, therefore, the integrand is positive, and so 
the integral itself is positive. Thus B(T) is positive at high temperatures. 

At low temperatures (T < U,), on the other hand, the region r > 2ro is 
the important one in the integral, and in this region U2/T is large and nega- 
tive. At sufficiently low temperatures, therefore, B(7’) must be negative, and 
its temperature dependence is essentially given by the exponential factor 
— Volt, 

Being positive at high temperatures and negative at low temperatures, 
B(T) must pass through zero at some temperature." 

Finally, let us consider a Joule-Thomson process involving a non-ideal 
gas. The variation of temperature during the process is given by the derivative 


(er), 7 Z Ir (Sr), - v| (74.9) 


see (18.2). For an ideal gas this derivative is of course zero, but for a gas with 
the equation of state (74.8) we have 
OoT\' WN /(,,dB N Uie 

pi a ian (pci: 1) ae ipa ied —Uy/T (1 — 722) — 

(P)_ C (T 8) 5G; f ; (1 ‘ 1 dV. (74.10) 
As in the discussion of B(T), it is easy to see that at high temperatures 
(67/O0P), < 0, ie. when the gas goes from a higher to a lower pressure 
in a Joule-Thomson process its temperature rises; at low temperatures, 
(OT/OP), > 0, i.e. the gas temperature falls with the pressure. At a def- 
inite temperature for each gas, called the inversion point, the Joule- 
Thomson effect must therefore change sign.t 


t The temperature 7, for which B(T,) = 0 is called the Boyle point. If PV/T is 
plotted against P for various given 7, the isotherm T = 7, has a horizontal tan- 
gent as P - 0, and separates isotherms with positive and negative initial slopes; 
all the isotherms start from the point PV/T = 1, P = 0. 

t It will be recalled that we are considering a gas which is only slightly non-ideal, 
so that the pressure is relatively low. The result that the inversion point is independ- 
ent of pressure is valid only in this approximation; cf.§ 76, Problem 4. 
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PROBLEMS 


PRoBLEM 1. Determine B(7) for a gas whose particles repel one another accord- 
ing to Uj, = a/r"(n > 3). 


SoLUTION. In (74.5) we put dV = 4xr? dr and integrate by parts with respect 
tor from 0 to oo; the substitution x/7r" = x then reduces the integral to a gamma 


function: 
Qn [a \s" 3 
pary= (=) ri). 


PROBLEM 2. The fugacity of a gas is the pressure P* which it would have for given 
values of the temperature and chemical potential if so rarefied that it could be 
regarded as an ideal gas. Determine the fugacity of a gas with the thermodynamic 
potential (74.7). 


SOLUTION. The chemical potential of the gas is (with u;4 given by (42.6)) 
fb = Mgt BP = T log P+4(T)+ BP. 


Equating this to Tlog P*+-7(7) by the definition of the fugacity, we have to the 
same accuracy as that of (74.7) 


BP\ NT/. 2NB 
p= gil) mpi (faeces VE 
ta) he) 


§ 75. Expansion in powers of the density 


The equation of state (74.6) derived in § 74 consists essentially of the first 
two terms in an expansion of the pressure in powers of 1/V: 


NT (, , NB(L) , N°C(T) | ’. 


Pat (ltt (75.1) 


V V y 


The first term in the expansion corresponds to an ideal gas. The second term 
is obtaining by taking into account the interaction between pairs of mole- 
cules, and the subsequent terms must involve the interactions between 
groups of three, four, etc., molecules. 

The coefficients B, C, ... in the expansion (75.1) are called the second, 
third, etc., virial coefficients. To determine these quantities, it is convenien tto 
begin by calculating the potential Q, not the free energy. Let us again consid- 
er a monatomic gas, and start from the general formula (35.5), which for a 
gas consisting of identical particles becomes 


e—alT — bi i euNIT [ Em. WT Ay. (75.2) 
= N ! | 
t The dimensionless small parameter with respect to which the expansion is 
made is actually the ratio Nvo/V of the “volume” v of one molecule to the gas 
volume per molecule V/N. 
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The factor 1/N! is included and the integration is then taken simply over the 
whole phase space of the system of N particles; cf. (31.7). 

In the successive terms of the sum over N, the energy Ey(p, g) has the 
following forms. For N = 0, of course, Eo(p, g) = 0. For N = 1, it is simply 
the kinetic energy of one atom: 


Ex(p, q) = p?/2m. 


For N = 2 it consists of the kinetic energy of the two atoms and the energy 


of their interaction: 
2 


2 
Ep, q) = Pe +Ui2. 


Similarly, 


3 Pa 
E3(p, 9) = 2m + U1 


where U3 is the interaction energy of three atoms (which in general is not 


equal to the sum U12+U13+U23), and so on. 
We substitute these expressions in (75.2) and use the notation 


eT mT \8? 
3 = ale mT 3 = (se ev!T (75.3) 


We shall see below that this is simply equal to P,,/T, where P,, is the pressure 
of an ideal gas for given T and V. We obtain 


2 
Q = —T log { +EV + {feo dVidV2 


3 
+5 { { | e-UuslT dV, dV2dV3+ } 


Each of the U2, Uisg, ... is a function only of the distances between the 
atoms; hence, by using the relative coordinates of the atoms (relative to the 
first atom, say), we reduce the multiplicity of the integrals by one, with a 


further factor V entering: 
2 
Q=-PV= —T log { +4 [ e-¥wr dV, 


j 


3 
+r ([etwrars dV gt .. 7 


Finally, we expand this expression in powers of &; the resulting series can be 
written as 


p= iy Jr tn, (75.4) 
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where 
Ji,=1, Je= f (e-4#/T—-1) dV 5, 


(75.5) 
J3= f f (e~Uiss/T — g—Uis/T _ e— Ui T _ e—Ur¥T 4.2) dV2 dV, 


etc. The structure of the integrals J, is evident; the integrand in J, is appreci- 
ably different from zero only if 7 atoms are close together, i.e. in a collision 
of 7 atoms. 

Differentiating (75.4) with respect to 4, we obtain the number of particles 
in the gas, since N = —(62/d4)7, » = V(OP/Ou);,y. Bearing in mind that 
by definition (75.3) 0&/du = €/T, we have 


In 
(n—1)! 


N=VY Er. (75.6) 
n=l 
The two equations (75.4) and (75.6) give in parametric form (with para- 
meter €) the relation between P, V and T, i.e. the equation of state of the 
gas. Eliminating &, we can obtain the equation of state in the form of the 
series (75.1) with any desired number of terms.* 


§ 76. Van der Waals’ formula 


In gases the interaction between molecules is very weak. As this inter- 
action increases, the properties of the gas differ more and more from those 
of ideal gases, and finally the gas condenses into a liquid. In the latter, the 
interaction between molecules is strong, and the properties of this interaction 
(and therefore those of the liquid) depend considerably on the particular 
liquid concerned. For this reason it is, as already mentioned, impossible to 
derive any general formulae giving a quantitative description of the proper- 
ties of a liquid. 

We can, however, find an interpolation formula which gives a qualitative 
description of the transition between the liquid and gaseous states. This for- 
mula must give the correct results in the two limiting cases. For rarefied 
gases it must become the formulae valid for ideal gases. When the density 
increases and the gas approaches the liquid state, it must take account of 
the finite compressibility of the substance. Such a formula will then give 


tT In the first approximation, P = T&,N = VE, whence P = NT/V = Pq. In the 
second approximation, P = Té( 1+43J26), N= VE(1+J,§); eliminating § from 
these equations (with the same accuracy), we have 
_ANT_NT 
we gpa" 


in agreement with (74.6). 
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a qualitative description of the behaviour of the gas in the intermediate 
range also. 

To derive such a formula, let us examine in more detail the deviations from 
the ideal state at high temperatures. As in the preceding sections, we shall 
first consider a monatomic gas; by the same arguments as used previously, 
all the resulting formulae will be equally applicable to polyatomic gases. 

The type of interaction between gas atoms described in § 74 (Fig. 11) en- 
ables us to determine the form of the leading terms in the expansion of B(T) 
in inverse powers of the temperature; here we shall assume that the ratio 
U,/T is small: 


UT «1. (76.1) 


Since Uy, depends only on the distance r between the atoms, we write in 
the integral (74.5) dV = 4xr* dr. Dividing the range of integration with re- 
spect to r into two parts, we write 


2r, °° 
BT) = 2x f (1 —e—Un/T)y? dr + 2 f (1—e—Us/T)p2 dr. 

0 279 
For values of r between 0 and 2ro, the potential energy U2 is in general 
very large. In the first integral we can therefore neglect the term e~#/" in 
comparison with unity. Then this integral is equal to the positive quantity 
b = 1674/3; if ry for a monatomic gas is regarded as the radius of the 
atom, then b is four times its volume. In the second integral, we have every- 
where | U2|/T < Uo/T «1. The integrand can therefore be expanded in 
powers of U:/T as far as the first non-vanishing term. The second integral 
then becomes 


oo 


2 2a 
a [ |Uya| r? dr =e: 
2rp 
where a is a positive constant. 
Thus we find 
B(T) = b—a/T, (76.2) 


and substitution of this in (74.4) and (74.7) gives for the free energy of the 
gas 
F= Fiat N*(Tb—a)/V, (76.3) 


and for the thermodynamic potential 
® = Di4+NP(b—a/T). (76.4) 


The desired interpolation formula can be obtained from (76.3), which 
itself does not satisfy the necessarv conditions, since it does not take account 
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of the finite compressibility of the substance. In (76.3) we substitute the 
expression for F,; from (42.4); this gives 


F = Nf(T)—NT log (e/N)—NT(log V—Nb/V)—N®a/V. (76.5) 


In deriving formula (74.4) for the free energy of a gas we assumed that the 
gas, though not sufficiently rarefied to be regarded as an ideal gas, never- 
theless occupies such a large volume that we can neglect triple and higher- 
order collisions between molecules, so that the distances between molecules 
are in general considerably larger than their dimensions. We may say that 
the gas volume V is always considerably greater than Nb; hence 


log (V—Nb) = log V + log (1—Nb/V) 
= log V—Nb/V. 
Hence (76.5) may be written 
F = Nf(T)—NT log [eV —Nb)/N]—N?a/V 
= Fiqg—NT log (1—Nb/V)—N?2a/V. (76.6) 


In this form the conditions stated above are satisfied, since when V is 
large the formula for the free energy of an ideal gas is obtained, and when V 
is small the formula shows that the gas cannot be indefinitely compressed, 
since the argument of the logarithm becomes negative when V < NB: 

If the free energy is known, we can determine the gas pressure: 


NT Na 
eh ap Ve 
or 
Na 
(?+77] (V—Nb) = NT. (76.7) 


This is the required interpolation formula for the equation of state of an 
actual gas. It is called van der Waals’ equation. 

Van der Waals’ formula is, of course, only one of an infinity of possible 
interpolation formulae which satisfy the requirements stated, and there is no 
physical reason to select any one of them; van der Waals’ formula is merely 
the simplest and most convenient.' 

From (76.6) we can find the entropy of the gas: 


S = Sia+N log (1—Nb/V), (76.8) 
and thence the energy E = F+T7S: 
E = Eiga—N?al/V. (76.9) 


t In actual applications of this formula, the values of the constants a and b 
must be chosen so as to give the best agreement with experiment. The constant 5 
cannot then be regarded as four times the molecular volume, even for a mon- 
atomic gas. 
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Thus the specific heat C, = (OE/6T), of a van der Waals gas is equal to that 
of an ideal gas; it depends only on the temperature and, in particular, may be 
constant. The specific heat C, is easily seen (cf. Problem 1) to depend not 
only on the temperature but also on the volume, and so cannot be constant. 

The second term in (76.9) corresponds to the energy of interaction of the 
gas molecules; it is, of course, negative, since on average the forces between 
molecules are attractive. 


PROBLEMS 


PROBLEM 1. Find C,—C, for a non-ideal gas described by van der Waals’ for- 
mula. 


SoLuTIon. Using formula (16.10) and van der Waals’ equation, we find 


N 


Cpa 1—2Na(V—NbyP/TV? * 


PROBLEM 2. Find the equation of an adiabatic process for a van der Waals gas 
of constant specific heat C,. 


SoLuTIon. Substituting in (76.8) S;g= N log V+Ne, log T (omitting unimpor- 
tant constants) and putting S = constant, we obtain the relation (V—Nb)T“e = 
constant. This differs from the corresponding equation for an ideal gas in that 
V is replaced by V—Nb. 


ProBLeM 3. For a gas of the same kind as in Problem 2, find the change in 
temperature on expansion into a vacuum from volume V, to V4. 


SOLUTION. In an expansion into a vacuum, the energy of the gas remains cone 
stant. Thus formula (76.9), with Ej, = NC,T, gives 


Na/i 1 
T,—-T, = — (—--——}). 
~~ CS cay: 7) 


PROBLEM 4. For a van der Waals gas find the temperature dependence of the 
inversion point for the Joule-Thomson effect. 


SoLuTion. The inversion point is determined by the equation (07/OV)p = T/V 
(see (74.9)). Substitution of T from (76.7) leads to an equation which has to be 
solved simultaneously with (76.7). Algebraic calculation gives the following depend- 
ence of the inversion point on pressure: 


Tiny = (244/(1—304P/a)). 


For any given pressure P < a/3b* there are two inversion points, between which 
the derivative (O7/OP)y is positive, while outside this temperature interval it is 
negative. When P > a/3b? there are no inversion points and (07/0P)y < 0 every- 
where.t 


_ t The upper inversion point as P + 0 (Tiny = 2a/b) corresponds to the case 
considered at the end of § 74. The lower inversion point for small P may not occur 
in a gas owing to its condensation into a liquid. 
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§ 77. Relationship of the virial coefficient and the scattering amplitude 


In calculating the virial coefficients in §§ 74-76 we have used classical sta- 
tistics, as is practically always justifiable. There is, however, methodological 
interest in the problem of calculating these coefficients in the quantum case; 
and such a case may actually occur for helium at sufficiently low tempera- 
tures. We shall show how the second virial coefficient may be calculated with 
allowance for the quantisation of the binary interaction of the gas particles 
(E. Beth and G. E. Uhlenbeck, 1937). We shall consider a monatomic gas 
whose atoms have no electronic angular momentum; and having in mind the 
case of helium, we shall also suppose for definiteness that the nuclei of the 
atoms have no spin and that the atoms obey Bose statistics. 

In the approximation concerned, it is sufficient to retain only the first three 
terms in the sum over N in formula (35.3), which determines the potential Q: 


Q = —T log { +) eU—Eig)/T 1 yy corte (77.1) 


Here the £,, are the energy levels of a single atom, and the E,, are those of 
a system of two interacting atoms. Our object is to calculate only those cor- 
rection terms in the thermodynamic quantities which are due to the direct 
interaction of the atoms; the corrections due to quantum exchange effects 
exist even in an ideal gas and are given by formula (56.15), according to 
which the exchange part of the second virial coefficient is (in the case of Bose 
statistics) 

Bexch = —3(x0h? mT". (77.2) 


Thus the problem reduces to the calculation of 


Z® = Y e@e-EwT, 
n 


and from this must be subtracted the expression which would be obtained 
for two non-interacting atoms. 

The energy levels E,, consist of the kinetic energy of the motion of the 
centre of mass of the two atoms (p?/4m, where p is the momentum of that 
motion, and m the mass of an atom) and the energy of their relative motion. 
Let the latter be denoted by ¢; this component is given by the energy levels 
of a particle of mass 4m (the reduced mass of the two atoms) moving in a 
central field Ui2(r), where U2 is the potential energy of the interaction of the 
atoms. The motion of the centre of mass is always quasi-classical ; integrating 
over its coordinates and momenta in the usual manner (cf. § 42), we obtain 


ZO = Vell (mT |nh29? See, 
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If we denote by Z,,, the part of Z™ which depends on the interaction of 
the particles, we can write 2 in the form 


Q = Qig—TV eT (mT |[nh?)? Zine - 


Regarding the second term asa small correction to the first term, and express- 
ing it in terms of 7, V and N by means of formula (45.5) for the chemical 
potential of an ideal gas, we obtain for the free energy the expression 


PF = Fig-T — mT 


8N2 / ah? \ 3/2 

V ( int 

Differentiation with respect to V gives the pressure, and the required part of 
the virial coefficient that is due to the interaction of the atoms is 


Bin(T) = —8(ah?/mTY” Zint . (77.3) 


The spectrum of energy levels < consists of a discrete spectrum of negative 
values (corresponding to a finite relative motion of the atoms) and a contin- 
uous spectrum of positive values (infinite motion). We denote the former 
by ¢,; the latter may be written in the form p?/m, where p is the momentum 
of the relative motion of the atoms when the distance between them has be- 
come very large. The whole of the sum Ze'™!7 over the discrete spectrum 
appears in Z,,,; from the integral over the continuous spectrum we must sep- 
arate the part corresponding to the free motion of non-interacting particles. 
To do this we proceed as follows. 

At large distances r, the wave function of a stationary state with orbital 
angular momentum / and positive energy p?/m has the asymptotic form 
(see Quantum Mechanics,§ 33) 


_ conStant., . /(p 
= XS ( i 5) 


. Fry ln+o1), 


where the phase shifts 6, = 5,(p) depend on the specific form of the field 
U,2(r). Let us formally suppose that the range of variation of the distance r 
is bounded by some very large but finite value R. Then the momentum p can 
take only a discrete series of values given by the boundary condition that 
y = Oforr = R: 

ca 


1 
F; R-> ln+ 0, = S77, 


where s is an integer. For large R these values are very close together and the 
summation in 


ea—PtmnT 
2 
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may be replaced by an integration. To do so, we multiply nee summand (for 


a given /) by 
_17/R , dé 
doa tar) dp 


and integrate over p; the result must then be multiplied by 2/+1 (the degree 
of degeneracy with respect to orientations of the orbital angular momentum) 
and summed over /: 


2 e-PimT — ~y (21 +1) i (5 a +a e-P'mT dp, 


For particles obeying Bose statistics and having no spin, the coordinate wave 
functions must be symmetric; this means that only even values of / are pos- 
sible, and so the summation over / is over all even integers. 

In free motion, all the phase shifts 6, = 0. The expression remaining when 
6, = 0 is therefore the part of the sum which is unrelated to the interaction 
of the atoms and is to be omitted. Thus we obtain the following expression 
for Zint? 


Zint = > elo 4 5] (2/+1) = e~rin? dp, (77.4) 
R 
rf 


and the virial coefficient B = B,,.4, +B, is 
B(T) = —1(ah2/mT*? (1 + 16Zin)- (77.5) 


The phase shifts 6, determine the scattering amplitude for particles moving 
in the field Us2(r) by means of the formulat 


fO = a ¥(2/+1) (1) Pi(cos 6), 


where the P, are Legendre polynomials, and 6 the angle between the directions 
of incidence and scattering; in the present case the summation is over all 
even values of /. It is therefore possible to express the integral in (77.4) in 
terms of the scattering amplitude. By direct substitution of the expression for 
f(@) the following relation may easily be verified: 


FOES = ay Gp POON + gas [mS PSE) ao 


t See Quantum Mechanics, § 123. The cross-section for scattering into the solid- 
angle element do is | f(0)|® do. 
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The sum on the left-hand side appears in the integrand in (77.4), and on sub- 
stituting it and integrating by parts in one of the terms we find 


Zin = el ae | premmmars(O)-+ SO) dp 


zonm 


0 
. * 
+ Gna | { prema f v. fe 4) dp do. (77.6) 

If there are discrete levels in the field U12(r), then at sufficiently low temper- 
atures the dependence of B(T) on temperature will be mainly governed by 
the sum over the discrete levels, which increases exponentially with decreas- 
ing T. There may, however, be no discrete levels; then the virial coefficient 
will vary as a power of the temperature (if we bear in mind that the scattering 
amplitude tends to a constant limit as p -> 0, we easily find that at sufficiently 
low temperatures B will be determined mainly by the term B,,,,). 

It may be noted that in the case of a weak interaction, when particle colli- 
sions can be described by the Born approximation, the scattering amplitude 
is small and the third term in (77.6), which is quadratic in the amplitude, may 
be omitted. For weak interaction there are no bound states, and so the first 
term in (77.6) is also absent. Using the familiar expression for the scattering 
amplitude f (0) in the Born approximation, it is easy to see that the expression 
for F agrees exactly with (32.3) (without the quadratic term), as it should 
in this case. 


PROBLEM 
Determine the quantum correction (of the order of #”) in the quasi-classical 
case in the virial coefficient B(T) for a monatomic gas. 


SOLUTION. The correction to the classical free energy is given by formula (33.15). 
Bearing in mind that in the present case only binary interaction of atoms occurs, 
and that U;, depends only on the distance between atoms, we find 


he dU,,\" 
Boy = | (8U12\" .- tity? ay, 
cs sar | (a2) « ee 


This expression is the correction to the classical value given by (74.5). It may be 
noted that By, > 0. 


§ 78. Thermodynamic quantities for a classical plasma 


The method given in § 75 for calculating the thermodynamic quantities for a 
non-ideal gas is certainly inapplicable to a gas consisting of charged particles 
with Coulomb interaction, since the integrals which appear in the formu- 
lae then diverge. Such gases must therefore be treated separately. 

{7 
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Let us consider a completely ionised gas or plasma. The charges on its 
particles will be denoted by z,e, where the suffix a refers to the different kinds 
of ion; e is the unit charge and the z, are positive or negative integers. Also, 
let n,9 be the number of ions of the ath kind per unit volume of the gas. The 
gas as a whole is, of course, electrically neutral, so that 


» ZaNgo = 0. (78.1) 


We shall suppose that the gas does not deviate greatly from the ideal state. 
For this to be so it is certainly necessary that the mean energy of the Coulomb 
interaction between twoions(~ (ze)’/r, where r ~ n— is the mean distance 
between ions) should be small compared with the mean kinetic energy of the 
ions (~ T). Thus we must have (ze) n* « T or 


n <« (T/z*e?)°, (78.2) 


Since the plasma is electrically neutral, the mean value of the Coulomb 
interaction energy of its particles would be zero if they were all uniformly 
and independently distributed in space. The first correction terms in the 
thermodynamic quantities for a plasma (in comparison with the values for an 
ideal gas) therefore arise only when the correlation between the positions of 
the various particles is taken into account. As a reminder of this, we shall 
call these the correlation corrections. 

Let us first determine the correction E,,,, to the plasma energy. As we 
know from electrostatics, the electrical interaction energy of a system of 
charged particles can be written as half the sum of the products of each charge 
and the potential of the field at the position of that charge due to all the 
other charges. In the present case 


corr = VF ye €ZaNa0Pa ’ (78.3) 


where ¢, is the potential of the field acting on an ion of the ath kind due to 
the other charges. To calculate these potentials we proceed as follows.’ 
Each ion creates around itself a non-uniformly charged ion cloud, which on 
average is spherically symmetrical. In other words, if we select any particular 
ion in the gas and consider the density of distribution of the other ions rela- 
tive to that ion, this density will depend only on the distance r from the centre. 
Let the density of distribution of ions (of the ath kind) in this ion cloud be 
denoted by 7,. The potential energy of each ion of the ath kind in the electric 
field around the ion considered is z,e¢, where ¢ is the potential of this field. 


t This method was used by P. Debye and E. Hiickel to calculate the thermody- 
namic quantities for strong electrolytes (1923). 
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Hence Boltzmann’s formula (38.6) gives 


Nq = Noe —Zatd!T (78.4) 


The constant coefficient is put equal to n,9, since at a large distance from the 
centre (where @ — 0) the density of the ion cloud must become equal to the 
mean ion density in the gas. 

The potential ¢ of the field in the ion cloud is related to the charge density 
in it (equal to Lez,n,) by the electrostatic Poisson’s equation: 


Ad =—Aney Zana. (78.5) 


Formulae (78.4) and (78.5) together give the equations of the self-consistent 
electric field of the electrons and ions. 

With the above assumption that the interaction of the ions is relatively 
weak, the energy ez,@ is small in comparison with T, and formula (78.4) may 
be written in the approximate form 


Ng = Mao d. (78.6) 


Substituting this in equation (78.5) and using the condition (78.1) for the 
gas to be neutral as a whole, we obtain 


Agd—xo = 0, (78.7) 
where 
x? = ae Y) 14022 (78.8) 


The quantity x has the dimensions of reciprocal length. 

The spherically symmetric solution of equation (78.7) is ¢ = constant 
e~**/r. In the immediate neighbourhood of the centre, the field must become 
the purely Coulomb field of the charge considered; this charge will be de- 
noted by ze. In other words, for sufficiently smallr we must have ¢ ez,/r. 
This shows that the constant must be taken as z,e, and so the required poten- 
tial distribution is given by 

od = eze—*"/r. (78.9) 


Hence we see, incidentally, that the field becomes very small at distances large 
compared with 1/x. The length 1/x can therefore be regarded as determining 
the dimensions of the ion cloud due to a given ion; it is also called the Debye 
length. All the calculations given here assume, of course, that this length is 
large in comparison with the mean distances between ions. This condition 
is clearly identical with (78.2). 


174 
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Expanding the potential (78.9) in series for small xr, we have 


€Z5 
o= area alan beet 


The terms omitted vanish when r = 0. The first term is the Coulomb field of 
the ion itself; the second term is clearly the potential produced by all the 
other ions in the cloud at the point occupied by the ion considered, and is the 
quantity to be substituted in formula (78.3): @, = —ez,x. 
Thus we have the following expression for the correlation part of the 
plasma energy: 
1 is 3/2 
ee = 5 Vue » Ngo = —Ve§ Vz (y na?!) ; (78.10) 


or, in terms of the total numbers of different ions in the gas N, = nV, 


eer / a (5 Nal) (78.11) 


This energy is inversely proportional to the square root of the temperature 
and to that of the volume of the gas. 

Integrating the thermodynamic relation E/T? = —(0/0T) (F/T), we can 
deduce from E,,,, the corresponding change in the free energy: 


3/2 
a Fa a (= N,#) (78.12) 


the constant of integration must be taken as zero, since when T + we 
must have F = F;,. Hence the pressure is 


_NT_ é 7 Ar 
fe VY ~ 3p3R \z (x Ne#2) 9 (78.13) 


where N=2N,. The thermodynamic potential © can be obtained from F by 
means of the theorem of small increments (as in § 74), i.e. by regarding the 
second term in (78.12) as a small increment to F,, and expressing it with the 
appropriate accuracy in terms of P and T:* 


= 0,4 (F) (z wat)”. (78.14) 


t This method was not available for going from (78.11) to (78.12), since the 
energy (78.11) was not expressed in terms of the requisite variables S and V. 
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§ 79. The method of correlation functions 


The advantage of the Debye-Hiickel method described in § 78 lies in its 
simplicity and physical clarity. Its basic drawback, however, is that it cannot 
be generalised to calculate further approximations with respect to the concen- 
tration. We shall therefore also give a brief description of another method 
(proposed by N. N. Bogolyubov, 1946), which, though more complicated, 
allows in principle the calculation of further terms in the expansion of the 
thermodynamic quantities. 

This method is based on a consideration of what are called correlation 
functions between the simultaneous positions of several particles at given 
points in space. The simplest and most important of these is the binary corre- 
lation function w,,, which is proportional to the probability of finding two 
particles (ions) simultaneously at given points r, andr,; the ions a and b may 
be of either the same or different kinds. Because the gas is isotropic and ho- 
mogeneous, this function of course depends only on r=|r, —r,|. We choose 
the normalisation coefficient in the function w,, such that this function 
tends to unity as r— oo, 

If the function w,, is known, the required energy £,,,, can be obtained 
by integration, using the obvious formula’ 


1 
| eae ap 2, NaN» [wast dV, dV;,, (79.1) 


where the summation is over all the kinds of ions, and u,, is the Coulomb 
interaction energy of a pair of ions at distance r. 
According to the Gibbs distribution formula, the function w,, 1s given by 


1 F—Fi4—-U 
Wab = y= exp = dVidV2...dVn—e, (79.2) 


where U is the Coulomb interaction energy of all the ions, and the integration 
is over the coordinates of all the ions except the two considered. For an ap- 
proximate calculation of this integral we proceed as follows. 

We differentiate equation (79.2) with respect to the coordinates of ion b: 


OWan Ss Wab Olen —s I OUbe 
Ors = T Or, TV Ne [ So ae Me (79.3) 


v 


t This formula itself is, of course, independent of the Coulomb nature of the 
interaction between the particles, and assumes only a binary interaction. 
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where the summation in the last term is over all the kinds of ions, and w,,, is 
the ternary correlation function, defined by 


1 F—Fiq—U 
Wobe = pros [ exp | dV, dV, ease dV n_3 


analogously to (79.2). 

Assuming the gas sufficiently rarefied and considering only the first-order 
terms, we can express the ternary correlation function in terms of binary cor- 
relations: neglecting the possibility that all three ions are close together, we 
have W,4. = WasWaW,- In the same approximation we can suppose that even 
the pairs of particles are not so close together that the w,, are appreciably 
different from unity. We define the small quantities 


Dab = Wab— | (79.4) 
and write 
Wabe = Dab + Mpc +Wac + 1, (79.5) 


neglecting the higher powers of the ,,. 

When this expression is substituted in the integral on the right-hand side 
of (79.3), only the term in @,, remains; the others give zero identically, be- 
cause of the isotropy of the gas. In the first term on the right of (79.3) it is 
sufficient to put w,, = 1. Thus 


OWap aA 1 OUgp Ube 


1 
or, ~ =F or, ~ FEN | Om or, eh: 


We now take the divergence of both sides of this equation, using the facts 
that u,, = z,z,e/r, r=r,—r,, and the well-known formula A(I/r) = 
—4z 6(r). The integration is then trivial because of the presence of the delta 
function, and we have 


4ne*z, 
TV 


4nz,Z,e" 
T 


AWay(K) = o(r) + 2, Nezetae(). (79.6) 


The solution of this system of equations can be sought in the form 
Was(t) = ZqZ00(r), (79.7) 


whereby they are reduced to a single equation 
Aw(r)—x*o(r) = ane d(r), (79.8) 


This final equation has the same form as equation (78.7) in Debye and 
Hiickel’s method; the term containing the delta function in(79.8) corresponds 
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to the boundary condition as r ~ 0 imposed on the function ¢(r) in (78.7). 
The solution of equation (79.8) is 

-£ ener 


or) = ; (79.9) 


r 


and this determines the binary correlation functions in the plasma. 

To calculate the energy, it is now sufficient to substitute w,, from (79.4), 
(79.7) and (79.9) in (79.1). Changing to integration with respect to the rela- 
tive coordinates of the two particles, we find 


ZaZpee Zaye? 
Foor =~ xv p> NgMp fae = Fre e~* Ager? dr; 


the term 1 in w,, makes no contribution to the energy because of the electri- 
cal neutrality of the plasma. On carrying out the integration, we return to 
the previous result (78.11). 

In the next approximation the calculations become more laborious. In par- 
ticular, the assumption (79.5) is now insufficient, and ternary correlations 
which do not reduce to binary ones must be introduced. For these we obtain 
an equation analogous to (79.3) but involving quaternary correlations; in this 
(the second) approximation the latter reduce to ternary ones.‘ 


§ 80. Thermodynamic quantities for a degenerate plasma 


In the theory given in§ 78, it was assumed that the plasma is far from degen- 
eracy, i.e. obeys Boltzmann statistics. Let us now consider a situation where 
the plasma temperature is so low that its electron component is degenerate: 


2 
Ts ae (80.1) 


where m is the electron mass (cf. (57.8)); the ion component may still be far 
from degeneracy, because of the large mass of the ions. It will be recalled 
that the condition for the plasma to be only slightly non-ideal is 


mz*8e2 [fens <« | (80.2) 


(see (57.9)); this is better satisfied at high plasma densities. 
For a degenerate gas, convenient variables are (together with the tempera- 
ture T and the volume V) its chemical potentials u, instead of the particle 


t The terms of next higher order in the thermodynamic quantities for a plasma 
have in fact been calculated (using a different method) by A. A. Vedenov and A. I. 
Larkin. Soviet Physics JETP (36) 9, 806, 1959. 
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numbers N,.‘ Accordingly, we shall calculate 2, the thermodynamic poten- 
tial with respect to these variables. Note that the chemical potentials are not 
all independent variables here: they are connected by a relation that follows 
from the electrical neutrality of the plasma, 


Vizag = ¥ 202/Ou_ = 0. (80.3) 


(a _ (of a0) 

TVs be ~ \ oA 
expresses the derivative of 2 with respect to a parameter A-in terms of the 
mean value of the corresponding derivative of the Hamiltonian of the sys- 
tem; cf. the analogous formulae (11.4), (15.11). In the present case, we take 
as the parameter A the squared charge e?. The plasma Hamiltonian contains 


e* as a common coefficient in the operator U of the Coulomb interaction of 
the particles. Hence 


The formula 


O22 on 
2), ee me 


so that the calculation of 2 is equivalent to that of the mean value (0). 

We shall see that, in a degenerate slightly non-ideal plasma, the main 
contribution to the corrections to the thermodynamic quantities for an ideal 
gas comes from the exchange part of the electrical interaction of the electrons 
(which in the classical case is unimportant and was ignored in§ 78). We there- 
fore retain in the operator U only the terms that describe the Coulomb inter- 
action of the electrons. 

The calculation of (0) is most simply carried out by the second quantisation 
method. According to this (see Quantum Mechanics, §§ 64, 65), we define a 
set of normalised wave functions y,, which describe the states of free elec- 
trons moving ina volume V w'th momenta p and spin components 0 (= +4). 
The momentum p takes an infinity of discrete values, the intervals between 
which tend to zero as V -+ oo. We also define the operators 4,, and 4%, of 
annihilation and creation of electrons in the state y,,, and use them to 
construct the p-operators 


p= » Protha ot = » ya. (80.5) 


The Coulomb interaction of the particles is a binary interaction; the operator 
of such an interaction in the second quantisation method is written as the 


t See § 85 for the definition of the chemical potentials of components of a 
mixture. 
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integral 


ee 7 | { ODOC) = HE) HE) ide. (80.6) 


ne —Fe 


The required averaging of this operator is done in two stages: first an 
averaging for a specified quantum state of the system and then an averaging 
over the equilibrium statistical distribution among the various quantum 
states. In a slightly non-ideal plasma, U acts as a small perturbation. We shall 
calculate the mean value of this quantity in the first approximation of pertur- 
bation theory, i.e. with respect to the states of a system of non-interacting 
particles (an ideal gas). 

The quantum averaging amounts to taking the appropriate diagonal 
matrix element. After substitution of the y-operators (80.5), the operator 
(80.6) is a sum of terms containing various products of four creation and 
annihilation operators: 


oS 52 (pipe | U2 | D1P2)dp; 0, 4p 3014p ,ox4p10, > (80.7) 


where the summation 1s taken over all momenta and spin components, and 
the <p.p, | Us| pip.) are the matrix elements of the interaction energy of two 
electrons, Uy, = e?/|r1—ra]; since the Coulomb interaction is independent 
of the spins, these matrix elements are taken for transitions without change 
of thé electron spin component, i.e. can be calculated from the purely orbital 
functions 


1 
Yo = Wwe 


Of all the terms in the sum (80.7), the only ones that have non-zero diago- 
nal matrix elements are those which contain two pairs of operators 4,,, 
a;, with the same suffixes, and the product 4+4,, is replaced simply by the 
occupation number of the relevant quantum state of the electrons.’ Putting 
P; = Pi, P. = py, we have the terms 


dVidV >. 
a2 ———— 8 
2V? woe me, area { lri—re| . ° ) 
and putting p; = pe Pe = Py, 0, = 0», the terms 
e dVidV, 
pee, (P1—Ps)- (1-1 )/A ‘ 
ay? we y Ny, ol lpo [e Da ) tra! ; (80.9) 


d 


+ The number of terms containing products of four operators with the same 
suffixes is infinitesimal compared with the number having two pairs of like suffixes, 
and they may therefore be omitted (the contribution of such terms to 2 would 
contain an extra factor 1/V). 
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the minus sign arises from the interchange of the operators 4¥, and d, , 
that is needed to bring the product 4¥ ay 4, 4), to the form 47,4, a7 4, , 
(for fermions, these operators anticommute). 

The terms (80.8) give just the energy of the direct Coulomb interaction of 
electrons uniformly distributed in space. As already noted in § 78, because 
of the electrical neutrality of the plasma these terms in fact cancel exactly 
the corresponding terms giving the energy of interaction of the other parti- 
cles (ions) with one another and with the electrons (and for this reason the 
divergence of the integral in (80.8) is unimportant). The terms (80.9), which 
contain non-diagonal matrix elements of the Coulomb potential, express the 
required exchange effect.’ 

Bearing in mind that with a macroscopic volume V the electron momenta 
have an almost continuous sequence of values, we can change from summa- 
tion over pi and pg to integration over V? d3p; d°p2/(22h)*; the limitation 
P1 ~ Pz then becomes unimportant. The integral in (80.9) ist 


Vf elor—pn rin dV /r = V -Anch?/(pi— ps2)” . 


The expression (80.9) then becomes 


Ome? Myo?no ps dpa 
oe Bulices: nye” 


The statistical averaging of this expression is taken (in the approximation 
considered) over the equilibrium distribution for an ideal gas. Because the 
particles of an ideal gas in different quantum states are statistically independ- 
ent, we then have 


(Np,o!p,0) a Noro Mpa} 
the mean values on the right are given by the Fermi distribution 
Ting = [e@—¥/T 4. 1]-1, 


where u, is the chemical potential of the electrons. Lastly, since the resulting 
expression is simply proportional to e?, according to (80.4) it gives 


t For a better understanding of the structure of the terms (80.8) and (80.9), 
it may be noted that in the former the pairs of operators 4,, and 4+, with the same 
indices come from ‘p-operators taken at the same point in space (r, or r,); in (80.9) 
they come from y-operators taken at different points. 

t Here we make use of the familiar expression for the Fourier component of 
the Coulomb potential: 


fer dV ir = 4rc/k?; 


see the footnote to § 117, Problem 3. 
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immediately the required correction to the thermodynamic potential of 
the plasma: 
47e? Mp, Mp, 93p1d3pe 
exe =——--V prea 2 ks Ope Se 80.10 
Sev [| eee Gay more 

(E. Wigner and F. Seitz, 1934). 

In the limit of a highly degenerate electron gas (T « f’n’?/m), the distri- 
bution 1, becomes a step function (m, = 1 for p< pp, = 0 for p> pr). 
The calculation of the integral then gives* 


Queer = —Verps|4r8h4 = —Verm*p2/r8ht. (80.11) 


This quantity, if the chemical potential in it is expressed in terms of the 
number density of electrons n, = N,/V according to (57.3), gives the cor- 
rection to the free energy: 
34/3 
F ck = —Ne aa e*nis = (80.12) 
In the opposite limiting case of a Boltzmann gas (u, < 0, |u,| > T), a 
calculation from formula (80.10) givest 
em]? 
4n7h4 


Qexch = —V erudT (80.13) 


or, expressing 4, in terms of n, according to (46.1a), 


re*h*n2 
2mT ~ 


p= {{ Sridirs Py, Pa =p 
Gan Pe 


is converted by the substitutions p,—p, = q, +P, +p.) = sinto/= ff q”? d°q d's, 
taken over the range | st$a| < pr. The integral f d°s (for a given q) is the volume 
between two spheres of radius pr whose centres are at a distance q apart: 


400 1 
[os = = PGpr-), h= prp->4.- 


Fexch = —V (80.14) 


t The integral 


2 


Integrating over q from 0 to 2pp, we get 1 = 4n°pt. 
t In this case 


Np, Np, = XP 


— | be Aes 


T mT 
_ 2u, 4s%+¢2 
= x | T  dmT 


and the integration over d°s d3q extends to the whole of q-space and s-space. 
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When T ~ p,, the exchange correction F,,., ~ Ve*n“®, whereas the 
correlation correction found in § 78 is F.,., ~ Ven®®/T™; from the condi- 
tion for slight non-ideality, 


Feorr/Fexch ™ (e* ni | Tyin « |, 


ie. the electron exchange correction is in fact the principal one. As the tem- 
perature increases, however, F,,., decreases more rapidly than F,,,. (for 
T >> bey Fen © T~*, but Foo I~"). There is consequently a region in 
which the two contributions are of the same order of magnitude, but in 
this region the degeneracy of the plasma is already slight and the classical 
formulae (78.11)-(78.14) are therefore applicable for the correlation cor- 
rection." 

In the foregoing discussion we have assumed that the ion component of 
the plasma is not degenerate and also that it is nearly ideal, i.e. that the ion 
interaction energy is small in comparison with the ion thermal energy: 
ne? << T.1 If the plasma density is not too large, 

meh? <« nl’ « F&M /h*, (80.15) 
where M is the ion mass, the temperature 7’ ~ ne? exceeds the ion degener- 
acy temperature: 

T ~ &nl3 = hr /M (80.16) 
(and T « e*M/h*). Under these conditions, the ion component forms a 
non-degenerate but far from ideal system. Then an ordered configuration 
of the nuclei corresponds to the minimum interaction energy of the ions 
with one another and with the electrons, i.e. the nuclei form a crystal lattice 
(A. A. Abrikosov, 1960). This leads to the result that the energies of the direct 
Coulomb interaction of various particles do not completely balance. In each 
lattice cell, the ion field is balanced by the electrons in the cell, but the 
interaction energy of the particles within one cell (dimensions ~ n~'*) is not 
zero. A rough estimate of this energy is ~ e*n"*, and for the whole !attice 
(with N ~ Vn cells) the binding energy is 


|Eiats| ~ Ne2n'8 ~ Vern, (80.17) 


This agrees in order of magnitude with the exchange energy of the degenerate 
electron component of the plasma. For a stable lattice the binding energy is, 
of course, negative.’ 


t The question of calculating the correlation correction for any degree of degen- 
eracy of the electrons has some methodological interest, nevertheless. This will 
be discussed in Part 2. 

+ In this and the following estimates we put for simplicity z = 1 (a hydrogen 
plasma). 

8 For a quantitative calculation of the lattice binding Snetey see A. A. Abri- 
kosov, Soviet Physics JETP 12, 1254, 1961. 


CHAPTER VIII 


PHASE EQUILIBRIUM 


§ 81. Conditions of phase equilibrium 


THE (equilibrium) state of a homogeneous body is determined by specifying 
any two thermodynamic quantities, for example the volume V and the energy 
E. There is, however, no reason to suppose that for every given pair of values 
of V and E the state of the body corresponding to thermal equilibrium will be 
homogeneous. It may be that for a given volume and energy in thermal 
equilibrium the body is not homogeneous, but separates into two homoge- 
neous parts in contact which are in different states. 

Such states of matter which can exist simultaneously in equilibrium with 
one another and in contact are described as different phases. 

Let us write down the conditions for equilibrium between two phases. 
First of all, as for any bodies in equilibrium, the temperatures 7; and 72 of 
the two phases must be equal: 


T1=Ts2. 
The pressures in the two phases must also be equal: 


Pi = Po, 


since the forces exerted by the two phases on each other at their surface of 
contact must be equal and opposite. Finally, the chemical potentials of the 
two phases must be equal: 


Hi = bas 


this condition is derived for the two phases in exactly the same way as in§ 25 
for any two adjoining regions of a body. If the potentials are expressed as 
functions of pressure and temperature, and the common temperature and 
pressure are denoted by T and P, we have 


UP, T) = bal P, T), (81.1) 


whence the pressure and temperature of phases in equilibrium can be ex- 
pressed as functions of each other. Thus two phases can not be in equilibrium 


231 
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with each other at all pressures and temperatures; when one of these is given, 
the other is completely determined. 

If the pressure and temperature are plotted as coordinates, the points at 
which phase equilibrium is possible will lie on a curve (the phase equilibrium 
curve), and the points lying on either side of the curve will represent homo- 
geneous states of the body. When the state of the body varies along a line 
which intersects the equilibrium curve, the phases separate at the point of 
intersection and the body then changes to the other phase. It may be noted 
that, when the state of the body changes slowly, it may sometimes remain 
homogeneous even when the phases should separate in complete equilibrium. 
Examples are supercooled vapours and superheated liquids, but such states 
are only metastable. 


Fic. 12 


If the equilibrium of phases is plotted in a diagram with temperature and 
volume (of a fixed quantity of matter) as coordinates, then the states in 
which two phases exist simultaneously will occupy a whole region of the 
plane, and not simply a curve. This difference from the (P, 7’) diagram arises 
because the volume V, unlike the pressure, is not the same for the two phases. 
The resulting diagram is of the kind shown in Fig. 12. Points in the regions I 
and II on either side of the hatched area correspond to homogeneous first 
and second phases. The hatched area represents states in which the two 
phases are in equilibrium: at any point a the phases I and II are in equilib- 
rium, with specific volumes given by the abscissae of the points | and 2 
which lie on a horizontal line through a. It is easily deduced directly from the 
mass balance that the quantities of phases I and II are inversely proportional 
to the lengths of the segments al and a2; this is called the Jever rule. 

In a similar way to the conditions for equilibrium of two phases, the 
equilibrium of three phases of the same substance is governed by the equa- 
tions 


Pi = Po=P3, T1=Te=Ts, wi = be= Ms. (81.2) 


§ 81 Conditions of Phase Equilibrium 253 


If the common values of the pressure and temperature of the three phases are 
again denoted by P and T, we have the conditions 


Lal P, T) = pl P, T) ma La P, T). (81.3) 


These give two equations in the two unknowns P and T, and their solutions 
are specific pairs of values of P and T. The states in which three phases are 
simultaneously present (called triple points) in the (P, T) diagram are repre- 
sented by isolated points which are the points of intersection of the equilib- 
rium curves of each pair of phases (Fig. 13, where regions I, II, IIT are those 
of the three homogeneous phases). The equilibrium of more than three pha- 
ses of the same substance is clearly impossible. 


Fic. 13 Fic. 14 


In the (T, V) diagram the neighbourhood of the triple point has the appear- 
ance shown in Fig. 14, where the hatched areas are those of equilibrium of 
two phases; the specific volumes of the three phases in equilibrium at the 
triple point (at the temperature T;,) are given by the abscissae of the points 
13, 

The change from one phase to another is accompanied by the evolution or 
absorption of a certain quantity of heat, called the /atent heat of transition 
or simply the heat of transition. According to the conditions of equilibrium 
such a transition occurs at constant pressure and temperature. But in a pro- 
cess occurring at constant pressure the quantity of heat absorbed by the body 
is equal to the change in its heat function. The heat of transition g per mole- 
cule is therefore 


q = We-W1, (81.4) 


where w, and wz¢are the heat functions per molecule of the two phases. The 
quantity qg is positive if heat is absorbed by the body in changing from the 
first to the second phase, and negative if heat is evolved. 

Since, for bodies consisting of a single substance, py is the thermodynamic 
potential per molecule, we can write u = e—Ts-+Pv (where «, s, v are the 
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molecular energy, entropy and volume). The condition uw; = us therefore 
gives 
(€,— &) —T (5, — 51) + P(g — 04) = (W, — Wy) — T (5, — 51) =, 
where T and P are the temperature and pressure of both phases; hence 
q=T(S_—5;) (81.5) 


We may note that this formula also follows directly from g = { T ds with T 
constant; the latter formula is applicable here, since the transition is revers- 
ible: the two phases remain in equilibrium during the transition process. 


Be 


Fie. 15 


Let the two curves in Fig. 15 represent the chemical potentials of the two 
phases as functions of temperature (at a given pressure). The point of inter- 
section of the two curves gives the temperature J) for which (at the given 
pressure) the two phases can exist in equilibrium. At any other temperature 
only one or the other phase can exist. It is easy to see that at temperatures 
below To the first phase exists, i.e. is stable, and at temperatures above To 
fhe second phase. This follows because the stable state is the one where u 
is smaller, since the thermodynamic potential tends to a minimum for given 
Pand T. At the point of intersection of the two curves, the derivative 041/0T 
is greater than 012/07, i.e. the entropy of the first phase, 5, = —Oy,/0T, is 
less than that of the second phase, sz = —O2/OT. The heat of transition 
q = T(se—5}) is therefore positive. Thus we conclude that, if the body goes 
from one phase to another when the temperature is raised, heat is absorbed 
in the process. This result could also be derived from Le Chatelier’s prin- 
ciple. 


PROBLEMS 


PROBLEM 1. Determine the temperature dependence of the saturated vapour 
pressure above a solid. The vapour is regarded as an ideal gas, and both the gas 
and the solid have constant specific heats. 


SOLUTION. The chemical potential of the vapour is given by formula (43.3) and 
that of the solid by (65.6); since the saturated vapour pressure is relatively small, 
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the quantity PV may be neglected for the solid, and ® taken as equal to F. Equat- 
ing the two expressions, we find 


P = constant T@p2~welos ~802)/T 


where the suffix 1 refers to the solid and 2 to the vapour. 
In the same approximation, the heat function of the solid may be taken as equal 
to its energy; the heat of transition (heat of sublimation) g = w.—wy, is 


q= (Cpg—ep)T + (€92— Eo): 


In particular, for T = 0 the heat of transition is qg = &)2— 1, so that we can 
write 
P = constant xT @p:—We—lT, 


PROBLEM 2. Determine the rate of evaporation from a condensed state into a 
vacuum. 


SOLUTION. The rate of evaporation into a vacuum is determined by the number of 
particles which leave unit surface area of the body per unit time. Let us consider 
a body in equilibrium with its saturated vapour. Then the number of particles 
leaving the surface is equal to the number which strike and “adhere to” this surface 
in the same time, i.e. Pg(1— R)/4/(21mT), where Py = P,(7) is the saturated vapour 
pressure, and R a mean reflection coefficient for gas particles colliding with the 
surface (see (39.2)). If Py is not too large, the number of particles leaving the surface 
of the body is independent of whether there is vapour in the surrounding space, 
so that the above expression gives the required rate of evaporation into a vacuum. 


§ 82. The Clapeyron—Clausius formula 


Let us differentiate both sides of the equilibrium condition ui(P,T) = 
xP, T) with respect to temperature, bearing in mind, of course, that the 
pressure P is not an independent variable but a function of temperature 
determined by this same equation. We therefore write 


Cis MISE Otte: CUBES 
or oP dT or' oP dT’ 


since (Qu/8T)p = —S, (Ou/AP)7 = v (see (24.12)), this gives 
dP = 5,—S, 


> 
dT v,—%, 


(82.1) 


where $1, ¥1, S2, Vg are the molecular entropies and volumes of the two 


phases. 
In this formula the difference s;—s2 may conveniently be expressed in 
terms of the heat of transition from one phase to the other. Substituting 


18 
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q = T(s2—51), we obtain the Clapeyron—Clausius formula: 


dP q 
dr _ T(v2—11) : ee) 


This gives the change in the pressure of phases in equilibrium when the 

temperature changes or, in other words, the change in pressure with tem- 

perature along the phase equilibrium curve. The same formula written as 
aT T(ve—1) 


dP q 


gives the change in the temperature of the transition between phases (e.g. 
freezing point or boiling point) when the pressure changes. Since the molec- 
ular volume of the gas is always greater than that of the liquid, and heat is 
absorbed in the passage from liquid to vapour, it follows that the boiling 
point always rises when the pressure increases (d7/dP > 0). The freezing 
point may rise or fall with increasing pressure, according as the volume 
increases or decreases on melting.’ 

All these consequences of formula (82.2) are in full agreement with Le 
Chatelier’s principle. Let us consider, for example, a liquid in equilibrium 
with its saturated vapour. If the pressure is increased, the boiling point must 
rise, and so some of the vapour will become liquid; this in turn will cause a 
decrease in pressure, so that the system acts as if to oppose the interaction 
which disturbs its equilibrium. 

Let us consider the particular case of formula (82.2) which relates to equi- 
librium between a solid or liquid and its vapour. Then formula (82.2) deter- 
mines the change in the saturated vapour pressure with temperature. 

The volume of a gas is usually much greater than that of a liquid or solid 
containing the same number of particles. We can therefore neglect the volume 
v1 in (82.2) in comparison with v2 (the second phase being taken to be a gas), 
ie. write dP/dT = g/Tve. Regarding the vapour as an ideal gas, we can 
express its volume in terms of the pressure and temperature by v2 = T/P; 
then dP/dT = gP/T?, or 


dlog P/dT = q/T?. (82.3) 


We may note that, in temperature intervals over which the heat of transition 
may be regarded as constant, the saturated vapour pressure varies exponen- 
tially with the temperature (oc e~*/7), 


t The liquid helium isotope He? is an exception. Its heat of fusion is negative 
over a certain temperature range. 
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PROBLEMS 


PROBLEM 1. Determine the specific heat of a vapour along the equilibrium curve 
of the liquid and its saturated vapour (i.e. the specific heat for a process in which 
the liquid is always in equilibrium with its saturated vapour). The vapour is re- 
garded as an ideal gas. 


SOLUTION. The required specific heat h = T ds/d7, where ds/dT is the deriva- 
tive along the equilibrium curve: 


ds Os Os\ dP Ov\ dP 
= T—_= Tj — Ti—|) —=c,—Ti—|) —. 
oar (gr),+ (a). ar? (ar) aT 
Substituting the expression given by (82.3) for dP/dT, and v = T/P, we find 
h= e,—q/T. 
At low temperatures, / is negative, i.e. if heat is removed in such a way that the 


vapour is always in equilibrium with the liquid, its temperature can increase. 


PROBLEM 2. Determine the change in the volume of a vapour with temperature 
in a process where the vapour is always in equilibrium with the liquid (i.e. along 
the equilibrium curve of the liquid and its vapour). 


SOLUTION. We have to determine the derivative dv/dT along the equilibrium 
curve: 
dv (sr) + (5) dP 
adr \OT), \OP),dT 
Substituting from (82.3), and v = 7/P, we find 
dv il 1 q 
dT P| ae i 5 


At low temperatures dv/dT < 0, i.e. the vapour volume decreases with increas- 
ing temperature in the process considered. 


§ 83. The critical point 


The phase equilibrium curve (in the PT-plane) may terminate at a certain 
point (Fig. 16), called the critical point; the corresponding temperature and 
pressure are the critical temperature and the critical pressure. At tempera- 
tures above JT, and pressures higher than P,, no difference of phases exists, 
the substance is always homogeneous, and we can say that at the critical 
point the two phases become identical. The concept of the critical point was 
first used by D. I. Mendeleev (1860). 
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In the coordinates T, V, when there is a critical point, the equilibrium 
diagram appears as in Fig. 17. As the temperature approaches its critical 
value, the specific volumes of the phases in equilibrium become closer, and 
at the critical point (K in Fig. 17) they coincide. The diagram in the coordi- 
nates P, V has a similar form. 


Fic. 16 Fic. 17 


When there is a critical point, a continuous transition can be effected 
between any two states of the substance without its ever separating into two 
phases. To achieve this, the state must be varied along a curve which passes 
round the critical point and nowhere intersects the equilibrium curve. In this 
sense, when there is a critical point, the concept of different phases is itself 
arbitrary, and it is not possible to say in every case which states have one 
phase and which have the other. Strictly speaking, there can be said to be two 
phases only when they exist simultaneously and in contact—that is, at points 
lying on the equilibrium curve. 

It is clear that the critical point can exist only for phases such that the 
difference between them is purely quantitative, for example a liquid and a gas 
differing only in the degree of interaction between the molecules. 

On the other hand, such phases as a liquid and a solid (crystal), or different 
crystal modifications of a substance, are qualitatively different, since they 
have different internal symmetry. It is clear that we can say only that a partic- 
ular symmetry property (symmetry element) exists or does not exist; it can 
appear or disappear only as a whole, not gradually. In each state the body 
will have one symmetry or the other, and so we can always say to which of 
the two phases it belongs. The critical point therefore cannot exist for such 
phases, and the equilibrium curve must either go to infinity or terminate by 
intersecting the equilibrium curves of other phases. 

An ordinary phase transition point is not a mathematical singularity of the 
thermodynamic quantities of the substance. For each of the phases can exist 
(though in a metastable state) beyond the transition point; the thermo- 
dynamic inequalities are not violated at that point. At the transition point the 
chemical potentials of the two phases are equal: u1(P, T) = p2(P, T); but this 
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point has no special property with respect to either one of the functions 
t(P, T) and uo(P, T).* 

Let us plotin the PV-plane an isotherm of the liquid and gas, i.e. the curve of 
P as a function of V in an isothermal expansion of a homogeneous body (abc 
and def in Fig. 18). According to the thermodynamic inequality (OP/oV )r < 0, 


P is a decreasing function of V. This slope of the isotherms must continue 
for some distance beyond their intersections (b and e) with the liquid— 
gas equilibrium curve; the segments bc and ed of the isotherms corre- 
spond to metastable superheated liquid and supercooled vapour, in which 
the thermodynamic inequalities are still satisfied. (A complete-equi- 
librium isothermal change of state between the points b and e corresponds, 
of course, to the horizontal segment be, on which separation into two phases 
occurs.) If we use the fact that the points b and e have the same ordinate P, 
it is clear that the two parts of the isotherm cannot pass continuously into 
each other; there must be a discontinuity between them. The isotherms 
terminate at points (c and d) where the thermodynamic inequality ceases to 
hold, i.e. 

(OP/eV)r = 0. (83.1) 
By constructing the locus of the points of termination of the isotherms of the 
liquid and gas, we obtain a curve AKB on which the thermodynamic inequali- 
ties are violated (for a homogeneous body), and which is the boundary of a 
region in which the body can never exist in a homogeneous state. The regions 


t It must be noted, however, that there is some degree of arbitrariness in these 
statements, due to an indeterminateness of u(P, T) in the region of metastability. 
The metastable state is one of partial equilibrium, having a certain relaxation 
time, in this case for the process of formation of nuclei of a new phase (see 
§ 162). The thermodynamic functions in such a state can therefore be defined only 
without taking account of these processes, and they cannot be regarded as the 
analytic continuation of the functions from the region of stability corresponding 
to the complete equilibrium states of the substance. 
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between this curve and the phase equilibrium curve correspond to super- 
heated liquid and supercooled vapour.’ It is evident that at the critical point 
the two curves must touch. 

Of the points lying on the curve AKB itself, only the critical point K 
corresponds to an actually existing state of the homogeneous body; this is 
the only point where the curve reaches the region of stable homogeneous 
states. 

It is worth mentioning that the condition (83.1) at the critical point can 
also be derived from the following simple considerations. Near the critical 
point, the specific volumes of the liquid and the vapour are almost the same; 
denoting them by V and V +6V, we can write the condition for equal pres- 
sures of the two phases as 


P(V, T) = P(V +6V, T). (83.2) 


Expanding the right-hand side in powers of dV and dividing by the small but 
finite quantity 6V, we have 


oP 1 o2P 
(57), +3 ov (az) + = 0. (83.3) 


Hence we see that, when 6V tends to zero, i.e. at the critical point, (@P/OV), 
must tend to zero. 

In contrast to the ordinary phase-equilibrium points, the critical point is a 
mathematical singularity of the thermodynamic functions of the substance 
(and the same applies to the whole curve AKB which bounds the region 
where homogeneous states of the body exist). The nature of this singularity 
and the behaviour of matter near the critical point will be discussed in 
§ 153. 


§ 84. The law of corresponding states 
Van der Waals’ interpolation formula for the equation of state, 


NT Na 


=p (84.1) 


P 


t The segment of the isotherm that corresponds to superheated liquid (bc in 
Fig. 18) may lie partly below the abscissa axis. Thus a superheated liquid may have 
a negative pressure. Such a liquid exerts an inward force on its boundary surface. 
Thus the pressure is not necessarily positive, and there can exist in Nature states 
(though only metastable ones) of a body with negative pressures, as already men- 
tioned in § 12. 
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is in qualitative agreement with the properties of the liquid— vapour transition 
which have been described in the preceding sections. The isotherms deter- 
mined by this equation are shown in Fig. 19. Curves passing above the crit- 
ical point K represent monotonically decreasing functions P(V) for T > T,. 
The isotherm that passes through the critical point has an inflexion there. 
At temperatures T < T,, each isotherm has a minimum and a maximum, 
and between them is a section with (OP/OV), > 0; these sections (shown by 
the broken lines in Fig. 19) do not correspond to any homogeneous states 
of matter that actually exist in Nature. 


As has been explained in § 83, a straight horizontal segment intersecting 
the isotherm corresponds to the equilibrium liquid-gas transition. The level 
at which this segment must be drawn is determined by the phase equilibrium 
condition #1 = fz, which we write as 


2 
fdy = 0, 
i 


the integral being taken along the path of the transition from a state of one 
phase to a state of the other phase. Integrating along the isotherm, we have 
du = v dP, and hence 


f V dP = 0. (84.2) 
1 


Geometrically, this condition signifies that the areas shown hatched in 
Fig. 19 for one isotherm are equal (Maxwell’s rule). 

The critical temperature, critical pressure and critical volume can be 
expressed in terms of the parameters a and b which appear in van der Waals’ 
equation. To do this, we differentiate (84.1) and put 
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which determine the point of inflexion on the isotherm. These equations 
with (84.1) give 

Te =a V. = 3Nb, Pea (84.3) 
We now use the reduced temperature, pressure and volume, 

Terr: P= biPs. VHV Ve. (84.4) 
In terms of these quantities, van der Waals’ equation becomes 


3 
ye 


(P+ Jer'-0 = 87". (84.5) 
It contains only V’, P’ and T’, and not quantities pertaining to a given 
substance. Equation (84.5) is therefore the equation of state for all bodies to 
which van der Waals’ equation is applicable. The states of two bodies for 
which their values of 7’, P’, V’ are equal are called corresponding states 
(clearly the critical states of all bodies are corresponding states). It follows 
from (84.5) that, if two bodies have equal values of two of the three quanti- 
ties T’, P’, V’, then the values of the third quantity are also equal, i.e. they 
are in corresponding states (the law of corresponding states). 

The “reduced” isotherms P’ = P’(V’) given by equation (84.5) are the 
same for all substances. The positions of the straight segments which give the 
liquid-gas transition points are therefore also the same. We can therefore 
conclude that for equal reduced temperatures the following quantities must 
be the same for all substances: (1) the reduced saturated vapour pressure, (2) 
the reduced specific volume of the saturated vapour, (3) the reduced specific 
volume of the liquid in equilibrium with the saturated vapour. 

The law of corresponding states can also be applied to the heat of transi- 
tion from the liquid to the gaseous state. Here the “reduced heat of evapora- 
tion” must be represented by a dimensionless quantity, g/7,. We can there- 


fore write’ 
q/T. = f(T/T,). (84.6) 


In conclusion, we may note that the law of corresponding states does not 
apply only to van der Waals’ equation. The parameters of a specific sub- 
stance disappear when reduced quantities are used in any equation of state 
containing only two such parameters. The law of corresponding states, taken 
as a general theorem not pertaining to any specific form of the equation of 
State, is somewhat more accurate than van der Waals’ equation, but its 
applicability is in general very restricted. 


* At temperatures considerably below the critical temperature, the ratio q/T, 
1S approximately 10 (where q is the molecular heat of evaporation). 


CHAPTER Ix 


SOLUTIONS 


§ 85. Systems containing different particles 


HITHERTO we have considered only bodies consisting of identical particles. 
Let us now go on to discuss systems which contain different particles. These 
include all kinds of mixtures of more than one substance; if the mixture con- 
tains much more of one substance than of the others, it is called a solution 
of the other substances in the predominant substance (the solvent). 

The number of independent components of the system customarily sig- 
nifies the number of substances whose quantities in a state of complete equi- 
librium can be specified arbitrarily. All the thermodynamic quantities for a 
system in complete equilibrium are entirely determined, for example, by the 
temperature, the pressure and the numbers of particles of the independent 
components. The number of independent components may not be the same 
as the total number of different substances in the system if a chemical reac- 
tion can occur between the latter; if such a system is in partial equilibrium 
only, the determination of its thermodynamic quantities requires, in general, 
a knowledge of the amounts of all the substances present in it. 

It is easy to generalise the results of § 24 to bodies consisting of different 
substances. Firstly, all the thermodynamic quantities must be homogeneous 
functions of the first order in all the additive variables—the numbers of the 
different particles and the volume. 

Next, instead of the concept of a single chemical potential of the body as 
the derivative of some one of its thermodynamic potentials with respect 
to the number of particles (§ 24), there are chemical potentials u, for each 
component of the mixture, which are the derivatives of the thermodynamic 
potential with respect to the numbers N, of particles of the components. 
Accordingly, the sum 2,dN, must now replace » dN in all the formulae 
(24.5) and (24.7)-(24.9). 

For example, the expression for the differential d® becomes 


and the chemical potential is 
4; = (OD/ON))p, r. (85.1) 
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The chemical potentials are then expressed as functions of the pressure, the 
temperature and the concentrations, i.e. the ratios of the numbers of particles 
of the different substances. These numbers of particles can appear in y, only 
as ratios, since ® is a homogeneous function of the first order in the N,, and 
the chemical potentials must therefore be homogeneous functions of zero 
order in these variables. 

From the fact that ® is a homogeneous function of the first order in the 
N,, we have, using Euler’s theorem, 


D = PN, O0/EN, = Das, (85.2) 


which is a generalisation of the formula ® = Nu. 
For the potential 2 we now have 


Q = F—Lu,N; 


and hence again 2 = —PV. The last formula ceases to be valid only for 
bodies in an external field, when the pressure in different parts of the bodies 
is different. 

The results of § 25 can also be generalised immediately: the conditions of 
equilibrium for a system in an external field require the temperature and also 
the chemical potential of each component to be constant throughout the 


system: 
/4; = Constant. (85.3) 


Finally, the Gibbs distribution for systems consisting of different particles 
becomes 


WnN,Na... = XD a aa (85.4) 


an obvious generalisation of formula (35.2). 


§ 86. The phase rule 


Let us now consider a system consisting of different substances and com- 
prising r phases in contact (each phase containing, in general, all the sub- 
stances). 

Let the number of independent components in the system be n. Then each 
phase is described by its pressure, temperature and n chemical potentials. 
We have seen in § 81 that the condition for equilibrium of phases consisting 
of identical particles is that temperature, pressure and chemical potential 
should be equal. It is evident that, in the general case of more than one com- 
ponent, the phase equilibrium condition will be that the temperature, pres- 
sure and each chemical potential are equal. Let T and P be the common 
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temperature and pressure of the phases. In order to distinguish the chemical 
potentials belonging to different phases and components we shall write them 
with a roman index for the phase and an arabic suffix for the component. 
Then the phase equilibrium conditions may be written 


w= w=... = wh 
Wa = We =... = MB, (86.1) 
Uh = Uh =... = Un. 


Each of these potentials is a function of n+1 independent variables: P, T, 
and n—1 concentrations of different components in the phase concerned 
(each phase contains n independent numbers of partides of different kinds, 
giving n— 1 independent ratios). 

The conditions (86.1) form a set of n(r—1) equations. The number of un- 
knowns is 2 +r(n—1). If these equations have solutions, the number of equa- 
tions must certainly not be greater than the number of unknowns, i.e. 
n(r—1) «< 2+7r(n—1), or 

ren+2., (86.2) 
In other words, in a system consisting of m independent components, not 
more than n +2 phases can be in equilibrium simultaneously. This is called 
Gibbs’ phase rule. We have seen a particular case of it in§ 81: when there is 
one component, the number of phases that exist in contact at one time 
cannot exceed three. 

If the number r of coexisting phases is less than n +2, n+2—r of the vari- 
ables in equations (86.1) can obviously take arbitrary values. That is, we can 
arbitrarily vary any n+2—r variables without destroying the equilibrium; 
the other variables must, of course, be varied in a definite manner. The num- 
ber of variables which can be arbitrarily varied without destroying the equi- 
librium is called the number of thermodynamic degrees of freedom of the sys- 
tem. If this is denoted by f, the phase rule may be written 

f=n+4+2-r, (86.3) 
where f cannot, of course, be less than zero. If the number of phases has its 
maximum possible value n +2, then f = 0, i.e. all the variables in equations 
(86.1) have definite values, and none of them can be varied without destroy- 
ing the equilibrium and causing one of the phases to disappear. 


§ 87. Weak solutions 


We shall now consider (in §§ 87-91) the thermodynamic properties of weak 
solutions, i.e. those in which the number of molecules of the dissolved sub- 
stances (the solutes) is much less than the number of solvent molecules. Let 
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us first take the case of a solution with only one solute; the generalisation to 
a solution with more than one solute is immediate. 

Let N be the number of solvent molecules in the solution, and m the num- 
ber of solute molecules. The ratio c = n/N 1s the concentration of the solu- 
tion, and from the above hypothesis c« 1}. 

Let us derive an expression for the thermodynamic potential of the solu- 
tion. Let oP, T, N) be the thermodynamic potential of the pure solvent 
(containing no solute). According to the formula ® = Nu (which is valid 
for pure substances) it can be written ®) = Nuo(P, T), where po(P, T) 
is the chemical potential of the pure solvent. Let « = a(P, T, N) denote the 
small change which would occur in the thermodynamic potential if one 
molecule of solute were added to the solvent. Since the solution is assumed 
weak, the solute molecules in it are comparatively far apart, and their inter- 
action is therefore weak. Neglecting this interaction, we can then say that 
the change in the thermodynamic potential! when n molecules are added 
to the solvent is nx. But the expression My) -+n« thus obtained fails to take 
account of the fact that all the molecules of the solute are identical. This is 
the expression which would be obtained from formula (31.5) if all the solute 
particles were regarded as different in calculating the partition function. 
As we know (cf. 31.7)), the partition function thus calculated must in fact 
be divided by n!.! 

This leads to an additional term T log n! in the free energy, and therefore 
in the potential ®. Thus 


@D = Nudol P, T)+na(P, T, N)+T log n!. 


Next, since 7 is itself a very large number, though small in comparison 
with N, we can write log n! = n log (n/e) in the last term. Then 


® = Nuotni{a+T log-(n/e)} 
= Nyuo+n¥ log [(n/e)e“/7]. 


We now take into consideration the fact that ® must be a homogeneous 
function of the first order in m and N. For this to be so it is clearly necessary 
that e*7 in the argument of the logarithm should be of the form f(P, TYN. 
Thus 

D = No +n log [(n/eN) f(P, T)). 


Defining a new function y(P, T) = T log f(P, T), we finally have for the 
thermodynamic potential of the solution the expression 


@ =: Nuo( P,T)+nT log (n/eN) +nyp(P, T). (87.1) 


t Here we neglect quantum effects, which is always permissible for a weak solu- 
tion, as it is for a sufficiently rarefied gas. 
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The assumption, made at the beginning of this section, about the addition 
of a term of the form ma to the potential of the pure solvent amounts essen- 
tially to an expansion in powers of n, retaining only the leading terms. The 
term of the next order in 7 is proportional to n?, and by the homogeneity 
in the variables n and N it must be n?8(P, T)/2N, where # is a function of P 
and T only. Thus the thermodynamic potential of a weak solution as far as 
the second-order terms is 


® = Nuo( P, T)+nT log (n/eN) + ny(P, T)+(n?/2N) B(P,T). (87.2) 


The generalisation of this to the case of a solution of more than one 
substance will obviously be 


® = Naot 5, mT log (m/eN)+¥ may +¥ (mme(2N)Bx, (87.3) 


where the n, are the numbers of molecules of the various solutes. 
From (87.1) we can easily find the chemical potentials for the solvent (1) 
and the solute (u’): the former is 


ft = OO/ON = po—Tn/N = po—Te, (87.4) 
and the latter is 
’ = 0O/On = T log (n/N) +y = Tlogcety. (87.5) 


§ 88. Osmotic pressure 


In this and the following sections we shall discuss some properties of solu- 
tions, again assuming them weak and therefore using the results of § 87. 

Let us suppose that two solutions of the same substance in the same sol- 
vent but with different concentrations c, and cg are separated by a partition 
through which solvent molecules can pass but solute molecules cannot (a 
semi-permeable membrane). The pressures on the two sides of the membrane 
will then be different; the argument in§ 12 to prove the equality of pressures 
is invalid here, because of the presence of the semi-permeable membrane. 
The difference between the pressures is called the osmotic pressure. 

The condition of equilibrium between the two solutions is (apart from the 
equality of their temperatures) that the chemical potentials of the solvent in 
them should be equal. The chemical potentials of the solute need not be the 
same, since the semi-permeability of the membrane means that there is equi- 
librium only with respect to the solvent. 

Denoting the pressures in the two solutions by P, and Pz, and using the 
expression (87.4), we obtain the equilibrium condition in the form 


Uo( Pi, T)~—exT = fo( Pe, T)— Col. (88.1) 
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The pressure difference P2—P, = AP (i.e. the osmotic pressure) is rela- 
tively small for weak solutions, and so we can expand jo(Pa, T’) in powers of 
AP, retaining only the first two terms: 


Ho( Pe, T) = Bo( Pi, T) +AP-Opo/OP. 
Substitution in (88.1) gives 
AP-Opo/OP = (co—evT. 
But O40/OP is just the molecular volume v of the pure solvent. Thus 
AP = (ce—c1)T/v. (88.2) 


In particular, if there is pure solvent on one side of the membrane 
(c, = 0, cz = c), the osmotic pressure is 


AP = Tv = nTIV, (88.3) 


where n is the number of solute molecules in a volume V of solvent; since 
the solution is weak, V is almost exactly equal to the total volume of the 
solution. Formula (88.3) is called van’t Hoff’s formula. It should be pointed 
out that this formula is applicable to weak solutions independently of the 
particular solvent and solute concerned, and that it resembles the equation 
of state of an ideal gas. The gas pressure is replaced by the osmotic pressure, 
the gas volume by the solution volume, and the number of particles in the 
gas by the number of molecules of solute. 


§ 89. Solvent phases in contact 


Let us consider the equilibrium of two solvent phases in contact, with a 
certain amount of the same substance dissolved in each. The equilibrium 
conditions are (apart from the equality of pressures and temperatures) the 
equality of the chemical potentials of the solvent and those of the solute in 
the two phases. Here we shall use the first condition, writing it in the form 


ACP, T)—q = ph(P, T)—cpT, (89.1) 


where ¢,, Cy afe the concentrations and pl, yo! the chemical potentials of 
the two phases of the pure solvent. 

It must be noted that the system considered here, consisting of two compo- 
nents and two phases, has two thermodynamic degrees of freedom. Of the 
four quantities P, T, cy, Cy, therefore, only two may be chosen arbitrarily; if 
we choose P or T and one of the concentrations, for example, then the other 
concentration has a definite value. 
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If the two solvent phases contained no solute, the condition for their equi- 


librium would be 
ua P, or Li o) = La P. 0° Ty), (89.2) 


the temperature and pressure of both phases being denoted by To and Pp. 

Thus, whereas in the equilibrium of pure solvent phases the relation be- 
tween pressure and temperature is given by equation (89.2), when any sub- 
stance is dissolved in these phases the relation is given by equation (89.1). 
For weak solutions the two SaueHons are not greatly different. 

Let us now expand ui(P, T) and ui(P, T) in equation (89.1) in powers of 
P—Po = AP and T—To = AT, where Poand To are the pressure and temper- 
ature at some point on the equilibrium curve of the pure solvent phases 
close to a given point P, T on the equilibrium curve of the solution phases. 
Retaining in the expansion only the first-order terms in AP and AT, and 
using (89.2), we have from (89.1) 


ees 7 oT +sp ol AP— ct = ett AT + 2 AP— cyl. 


But —Oo/OT and Oyo/OP are just the entropy s and the volume v of the 
pure solvent (per molecule). Adding the suffix denoting the phase, we have 


_ (s1— Sq) AT + (v1 _ vy) AP = (c _ Cy)T. (89.3) 


According to formula (81.5), we have (s,;—s)T = q, where q is the latent 
heat of transition of the solvent from phase I to phase IT. Thus (89.3) may 


be written 
(9/T) AT + (u— vn) AP = (cy T. (89.4) 


Let us examine two particular cases of this formula. We first choose the 
point Po, To such that Py = P. Then AT will be the horizontal distance be- 
tween the two curves, i.e. the change in the temperature of transition be- 
tween the two phases when the solute is added, or the difference between the 
transition temperature T (at pressure P) when both phases are solutions 
and the transition temperature T» (at the same pressure) for the pure sol- 
vent. Since AP = 0 here, (89.4) gives 


AT = T*(c;—cn)/q. (89.5) 

If one of the phases (I, say) is the pure solvent (c,; = 0, ¢ = c), then 
AT = T°c/q. (89.6) 
This formula determines, in particular, the change in the freezing point when 
the solute is added, if the solute is insoluble in the solid phase; the two phases 


are then the liquid solution and the solid solvent, and AT is the difference 
between the temperature at which the solvent freezes out of the solution and 
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that at which the pure solvent freezes. On freezing, heat is liberated, i.e. g 
is negative. Hence AT < 0 also; i.e. if the pure solvent freezes out, the addi- 
tion of solute lowers the freezing point. 

The relation (89.6) also determines the change in the boiling point when 
the solute is added, if the solute is not volatile; the two phases are then the 
liquid solution and the solvent vapour, and AT is the difference between the 
temperature at which the solvent boils off from the solution and that at which 
the pure solvent boils. Since heat is absorbed in boiling, g > 0 and therefore 
AT > 0, i.e. the boiling point is raised by the addition of the solute. 

All these consequences of formula (89.6) are fully in accordance with Le 
Chatelier’s principle. For example, let a liquid solution be in equilibrium 
with the solid solvent. If the concentration of the solution is increased, then 
by Le Chatelier’s principle the freezing point must be lowered so that part 
of the solid solvent is added to the solution and the concentration 1s thereby 
lowered. The system as it were counteracts its disturbance from the equi- 
librium state. Similarly, if the concentration of the liquid solution in equi- 
librium with the solvent vapour is increased, the boiling point must be raised 
so that part of the vapour condenses into the solution and the concentra- 
tion is lowered. 

Let us now consider another particular case of formula (89.4), choosing 
the point Po, T> so that T = To. Then AP is the vertical distance between 
the two curves, i.e. the difference between the pressure of the two solution 
phases in equilibrium and that of the two pure solvent phases in equilibrium 
(at the same temperature). Here AT = 0, and from (89. 4) we have 


AP = T(q1—cn)|(v1— 1) (89.7) 


Let us apply formula (89.7) to an equilibrium between liquid and gaseous 
phases. Then the volume of one phase (the liquid) may be neglected in 
comparison with that of the other, and (89.7) becomes 


AP = T(c1—cu)/2, (89.8) 


where v is the molecular volume of the gas phase (I). Noting that Pu = T, 
and substituting to the same accuracy P x Pp (where Po is the saturated 
vapour pressure over the pure solvent), we can write this formula as 


AP = Po(cr << cu): (89.9) 


If the gas phase is the pure solvent vapour (c, = 0, ¢y = c), then (89.9) 
becomes 
AP/P) =—c, (89.10) 


where c is the concentration of the solution. This formula gives the difference 
between the saturated vapour pressure of the solvent over the solution (P) 
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and over the pure solvent (Po). The relative decrease in the saturated vapour 
pressure when the solute is added is equal to the concentration of the solu-. 
tion (Raoult’s law).* 


§ 90. Equilibrium with respect to the solute 


Let us now consider a system consisting of two solutions in contact, the 
solutions being of the same substance in different solvents (for instance, in 
two immiscible liquids), and their concentrations being denoted by c: 
and Ce. 

The equilibrium condition for this system is that the chemical potentials 
of the solute in the two solutions should be equal. Using (87.5), we can write 
this condition in the form 


T log c1+yi(p, T) = T log co+po(p, T). 


The functions p; and ype are, of course, different for the different solvents. 


Hence we find 
C1/Co = eve—vd/T, (90.1) 


The right-hand side of this equation is a function of P and T only. Thus the 
solute is distributed between the solvents in such a way that the ratio of 
concentrations is always the same (for given pressure and temperature), 
independently of the total quantities of the solute and solvents (the distri- 
bution law). The same law obviously applies to a solution of one substance 
in two adjacent phases of the same solvent. 

Now let us consider the equilibrium between a gas (assumed ideal) and a 
solution of it in a liquid or solid solvent. The equilibrium condition, i.e. the 
equality of the chemical potentials of the pure gas and the dissolved gas, can 
be written (using (42.6) and (87.5)) in the form 


T logce+y(P, T) = T log P+ (7), 
whence 
C= Pes-vVT, (90.2) 


The function p(P, T) describes the properties of the liquid (or solid) solu- 
tion. At low pressures, the properties of a liquid depend only very slightly on 
the pressure. Hence the dependence of p(P, T) on the pressure is unimportant, 
and we can suppose that the coefficient of P in (90.2) is a constant independ- 


ent of the pressure: 
c = PXconstant. (90.3) 


t It will be remembered that ¢ denotes the molecular concentration (ratio of 
numbers of molecules, 2/N). 
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Thus, when a gas dissolves, the concentration of the (weak) solution is pro- 
portional to the gas pressure (Henry’s law).* 


PROBLEMS 


PROBLEM 1. Find the variation of concentration with height for a solution in a 
gravitational field. 


SOLUTION. We apply the equilibrium condition (85.3) in an external field, 
writing it for the solute: T log e+(P, T)+mgz = constant, since the potential 
energy of a solute molecule in the gravitational field is mgz (z being the height, 
and m the mass of the molecule). We differentiate this equation with respect to z, 
noting that the temperature is constant by one of the conditions of equilibrium: 


Since the volume of the solution is 
o®@ 


EA Hoy 5 OP. 
oP 


“op +" op 
(substituting for ® the expression (87.1)), the quantity Op/OP may be called the vol- 
ume v’ per molecule of solute. Hence 


dP 
7 Pea faa a 0. 


In order to find P as a function of z, we use the equilibrium condition for the 


solvent:+ 
v dP/dz+ Mg = 0, 


where v = O)/OP is the molecular volume and M the mass of a solvent molecule. 
Substituting dP/dz in the previous condition, we find 


T d 
— < 4+. mg—Me —- = 0. 
c dz 7) 


If the solution may be regarded as incompressible, i.e. v and v’ are constants, 


this gives 
c= cge7 #/T) (m—v’M/»v) ; 


where ¢p is the concentration of the solution when z = 0, i.e. the usual barometric 
formula corrected in accordance with Archimedes’ principle. 


t It is assumed that the molecules dissolve unchanged. If they dissociate (as in 
the dissolution of hydrogen H, in certain metals), the dependence of the concentra- 
tion on the pressure is different; see § 102, Problem 3. 

t In this condition the term involving the concentration (—T de/dz) is small 
and may be omitted; in the condition for the solute, it contained ¢ in the denomi- 
nator and was therefore not small. 
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PROBLEM 2. Find the relation between the changes in the solubilities of two sub- 
stances simultaneously dissolved in the same solvent.t 


SOLUTION. The interaction between the two solutes is taken into account by 
the quadratic term (proportional to n,n.) in the thermodynamic potential (87.3). 
The chemical potentials of the solutes are 


My = OD/On, = T log ¢, +9, +81 +P 2 


and similarly for wz (the concentrations are c, = n/N, C, = mg/N). The sol.- 
bilities cg, and cog of each substance in the absence of the other are given by t c 
equilibrium conditions. 


Boy = Tlog eg +4, +¢o81» 


Loz = T log coo t+Y2+CooPes » 


where 4g, and jfigg are the chemical potentials of the pure solutes. The joint 
solubilities cy, and co, aregiven by 


Lo, = T log Cort Prt CoP 11+ Co2h ia» 
Mog = T log cq9+2+Co2Pa2+ C0112 


Subtracting (1) from (2) term by term, and using the relative smallness of the chan- 
ges in the solubilities (cg, = ¢91—Co1 “« Coy» S€o2 <K Cog), we find 


T 6eq,/ C1 = —coohr2» T dco2/ Con = —eqPy2 . 


tt} 


(2) 


Hence 
8eo, = Seq2, 


ie. the changes in the solubilities of the two substances are equal. 


PROBLEM 3. Find the relation between the changes of saturated vapour pressure 
of two solutes when both are present. 


SOLUTION. The saturated vapour pressures above solutions of each substance 
separately are given by the equilibrium conditions 


T log Py +44(7) = T log e+y, +81, 
T log Po+y(T) = T log co+Wet+CoPoo ; 


the expressions on the left-hand side are the chemical potentials of the two sub- 
stances inthe vapour. The pressures P; and P, above the solution of both solutes 
are given by 

Tlog Pi +4, = Tlog ce +yteBytePis, 


T log Po+4%o = T log co+ Wet CoB 22+cBi2- 
Hence, if the changes dP, = P;—P, and OP, are small, we have 
T OP, /P, = CoBj2, T OP2/Pp = cyBi9. 
and the required relation 


¥ The solubility is the concentration of a saturated solution. It is assumed that 
this concentration is still so small that the formulae of the theory of weak solu- 
tions are applicable. 
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§ 91. Evolution of heat and change of volume on dissolution. 


The process of dissolution is accompanied by the evolution or absorption 
of heat. Let us now calculate the quantity of heat involved, and first deter- 
mine the maximum work which can be done as a result of the dissolution 
process. 

Let us suppose that the dissolution occurs at constant pressure and tem- 
perature. In this case the maximum work is determined by the change in the 
thermodynamic potential. Let us calculate it for a process in which a small 
number 6n of solute molecules are dissolved in a solution already of concen- 
tration c. The change 5@ in the total thermodynamic potential of the system 
is equal to the sum of the changes in the potentials of the solution and the 
pure solute. Since én molecules of solute are added to the solution, the change 
in its thermodynamic potential is 


OD,o1 


ODay = an 


én = p’ On, 


where ,’ is the chemical potential of the solute in the solution. The change in 
the potential ®, of the pure solute is 
oD, 


6D, = — an bn = — Ug On, 


since the number of molecules of it decreases by dn, 4g being the chemical 
potential of the pure solute. The total change in the thermodynamic potential 
in this process is therefore 


6D = On’ — Uy). (91.1) 
We now substitute y’ from (87.5): 
6® = —T dn log a s (91.2) 
where 
CoP, T) = eo-vT (91.3) 


is the solubility, i.e. the concentration of a saturated solution (that is, one 
which is in equilibrium with the pure solute). This is clear from the fact 
that in equilibrium ® must have a minimum, i.e. we must have 6® = 0. 
Formula (91.3) can also be derived directly from the condition for equilibri- 
um between the solution and the pure solute, i.e. from the equality of the 
chemical potentials of the pure solute and that in the solution. Note, 
however, that co may be identified with the concentration of the saturated 
solution only if co is small, since all the formulae in the last few sections are 
applicable only to small concentrations. 
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The expression obtained gives the required quantity of work: | 6®| is the 
maximum work which can be done by the dissolution of 6n molecules, and is 
also the minimum work which is needed to separate dn molecules of solute 
from a solution of concentration c. 

There is now no difficulty in calculating the heat 6Q, absorbed in dissolu- 
tion at constant pressure (if 50, < 0, this means that heat is evolved). The 
quantity of heat absorbed in a process which occurs at constant pressure is 
equal to the change in the heat function (§ 14). Since, on the other hand, 


we have! 


50> = Sl 7), (91.4) 


Substituting the expression (91.2) in this formula, we find the required quan- 
tity of heat: 


6Op = T? bn 0 log co/OT. (91.5) 


Thus the quantity of heat involved in the dissolution process is related to the 
temperature dependence of the solubility. We see that dQ, is simply propor- 
tional to én; this formula is therefore applicable also to the dissolution of any 
finite quantity of substance (so long as the solution remains weak, of course). 
The quantity of heat absorbed in the dissolution of n molecules is 


Op = Tn 0 log co/OT. (91.6) 


We may also determine the change in volume on dissolution, i.e. the differ- 
ence between the volume of the solution and the sum of the volumes of the 
pure solute and the solvent in which it is dissolved. Let us calculate this 
change dV in the dissolution of dn molecules. The volume is the derivative 
of the thermodynamic potential with respect to the pressure. The change in 
volume is therefore equal to the derivative, with respect to pressure, of the 
change in the thermodynamic potential: 


bv =< 80. (91.7) 


t The corresponding formula for the quantity of heat in a process which occurs 
at constant volume is 


(91.4a) 
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Substituting 6® from (91.2), we find 


0 
6V =-T On ae log Co. (91.8) 


In conclusion, it may be noted that formula (91.6) is in accordance with 
Le Chatelier’s principle. Let us suppose, for example, that Qp is negative, 
i.e. that heat is evolved on dissolution, and let us consider a saturated solu- 
tion. If this is cooled, then by Le Chatelier’s principle the solubility must 
increase so that more dissolution occurs. Heat is then evolved, i.e. the system 
as it were counteracts the cooling which disturbs its equilibrium. The same 
follows from (91.6), since in this case Oco/OT is negative. Similar arguments 
show that formula (91.8) is also in accordance with Le Chatelier’s principle. 


PROBLEMS 


PROBLEM 1. Find the maximum work that can be done in the formation of a 
saturated solution. 


SOLUTION. Before dissolution, the thermodynamic potential of the pure solvent 
was Nip, and that of the pure solute ng. The potential of the whole system was 
®, = Nuot+npy. After dissolution, the thermodynamic potential ®, = Nuio+ 
nT log (n/eN)+-ny. The maximum work is 


Riss = ®, —®, 
= —nT log (n/eN)+n(ug—Y) 
= nT log (eco/c); 


this may also be derived by integration of (91.2). If a saturated solution is formed, 
ie. c = coandn= Ne = Nop, then. 


Rmax = nT = NeoT. 


PROBLEM 2, Find the minimum work which must be done to raise the concentra- 
tion of a solution from c, to c, by removing some of the solvent. 


SOLUTION. Before the removal, the thermodynamic potential of the solution was 
D, = NuotNe,T log (c,/e)+Neyp (the number of solute molecules was Ney, 
where N was the original number of solvent molecules). In order to raise the con- 
centration of the solution to cz, we must remove from it N(1~c,/c,) solvent 
molecules, The sum of the thermodynamic potentials of the remaining solution 
and the solvent removed gives ®, = Nig +Ne,T log (cg/e)+Neyp. The minimum 
work is 


Rmin = P2 —P, = Ne,T log (€2/c;). 
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§ 92. Solutions of strong electrolytes 


The method of expanding the thermodynamic quantities in powers of the 
concentration used in the preceding sections is completely inapplicable in the 
important case of solutions of strong electrolytes, that is, substances which 
dissociate almost completely into ions when dissolved. The slow decrease of 

.the Coulomb interaction forces between ions with increasing distance leads to 
terms proportional to a power of the concentration lower than the second 
(namely, the $ power). 

It is easy to see that the problem of determining the thermodynamic quan- 
tities of a weak solution of a strong electrolyte reduces to the problem of a 
completely ionised gas discussed in § 78 (P. Debye and E. Hiickel, 1923). 
This result may be derived by starting from the fundamental statistical for- 
mula (31.5) for the free energy. The integration in the partition function will 
be carried out in two stages, first integrating over the coordinates and mo- 
menta of the solvent molecules. Then the partition function becomes 


j e-F@. OT AI, 


where the integration is now taken only over the phase space of the electro- 
lyte particles, and F(p, q) is the free energy of the solvent with the ions 
“fixed” in it, the ion coordinates and momenta being regarded as parame- 
ters. We know from electrodynamics that the free energy of a system of 
charges in a medium (of given volume and temperature) can be deduced from 
the energy of the charges in empty space by dividing the products of each pair 
of charges by the dielectric constant « of the medium.’ The second step in 
calculating the free energy of the solution is therefore identical with the calcu- 
lations given in § 78. 

Thus the required contribution of the strong electrolyte to the free energy 
of the solution is given, according to (78.12), by 


~aae (rr) (ra) 


where the summation is over all the kinds of ion in the solution; in accord- 
ance with the notation used in this chapter, n, denotes the total number of 
ions of the ath kind (in the whole volume of the solution). The same expres- 
sion gives the contribution to the thermodynamic potential for given temper- 
ature and pressure. Putting V = Nv, where v(P, 7) is the molecular volume 


t This assumes that the distances between ions are large compared with mo- 
lecular dimensions, but we know from § 78 that in the approximation considered 
the main contribution to the thermodynamic quantities comes in fact from these 
distances. 
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of the solvent, we can write the thermodynamic potential of the solution in 
the form 
D = Nuot) nal log (na/eN)+ > napa 
a a 


“42 2\ 3/2 oe 
_ 2a (Fr) (AF) 2.1) 


323/2 N 


From this we can find, by the usual rules, any of the thermodynamic props 
erties of the electrolyte solution. For example, to calculate the osmotic pres- 
sure we write chemical ame of the solvent as 


1/2 7 Sy ze = 
ae Wore 36 = (ar) ( ne " - (92.2) 
As in § 88, we find from this a osniotic pressure (at an interface wate the 
pure solvent) 
n\ V2 (Snz2\ 3? Ei! 
~ VS ox 3a <.(%) ( V ) . oe 
The heat function of the solution is 
o ®@ 0 Wa 
se OE oe —_T2YN. pike 
‘oes Tar r),7 RT one” (Se 
2e3 (2 ' 0 1 _— 
toa (=) (An .ze0T? sp (arsasa)- 5 ES) 


From this we can find the heat of solution Q which is liberated when the 
solution is diluted (at constant P and T’) with a very large amount of solvent 
(so that the concentration tends to zero). This quantity of heat is given by the 
change in the heat function during the process. The terms linear in the num- 
ber of particles obviously give zero difference, and we find from (92.4) 


1/2 2. 3/2 
Q= = w (Fae) Ts ( caepaaae (92.5) 


N es/273/ 2a /2 


The only condition for the above formulae to be valid is that the concentra- 
tion should be sufficiently small. For the fact that the electrolyte is strong 
‘means that the energy of attraction between ions of different kinds is always 
less than T. Hence it follows that the interaction energy is certainly small 
compared with T at distances large compared with molecular distances. But 
the condition n « N for the solution to be weak means precisely that the 
mean distance between ions is large in comparison with molecular dimen- 
sions. Thus this condition neeeeTnY implies that the condition of weak 


inter action: 
nfV <« (eT/2e) 


(cf. (78.2)) is satisfied, and this is the basis of the approximations used in§ 78. 
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PROBLEM 


Find the change in the solubility (assumed small) of a strong electrolyte when a 
certain quantity of another electrolyte is added to the solution (all the ions of 
the second electrolyte being different from those of the first). 


SOLUTION. The solubility (i.e. the concentration of a saturated solution) of the 
strong electrolyte is given by the equation 


UP, T) = y Melg= T » Yq log (ng N)+ » VaWa 


ee 12 1/2 
“e(me) (RAE) 


Here p, is the chemical potential of the pure solid electrolyte, and y, the number 
of ions of the ath kind per molecule of the electrolyte. When other ions are added 
to the solution, the chemical potentials of the original ions are changed because 
of the change in the sum 2n,z?, which must include all ions present in the solution. 
Having defined the solubility cy by ng/N = v,Co, we find the change in it by varying 
the expression (1) for given P and T: 


7a'l%e3(Enyzp)¥l2 


Oo = Fah yiapaRNSRyy, M076). 


The sum following 6 includes only the added kinds of ion. It should be noted that 
the solubility is raised under the conditions assumed. 


§ 93. Mixtures of ideal gases 


The additivity of the thermodynamic quantities (such as energy and en- 
tropy) holds good only so long as the interaction between the various parts of 
a body is negligible. For a mixture of several substances, e.g. a mixture of 
several liquids, the thermodynamic quantities are therefore not equal to the 
sums of the thermodynamic quantities for the individual components of the 
mixture. 

An exception is formed by mixtures of ideal gases, since the interaction 
between their molecules is by definition negligible. For example, the entropy 
of such a mixture is equal to the sum of the entropies which each of the gases 
forming the mixture would have if the other gases were absent and the vol- 
ume of the one gas were equal to that of the mixture, its pressure therefore 
being equal to its partial pressure in the mixture. The partial pressure P, of 
the ith gas is expressed in terms of the pressure P of the whole mixture by 


P,= NI|V = NPIN, (93.1) 


where N is the total number of molecules in the mixuture, and N, the number 
of molecules of the ith gas. Hence, by (42.7), the entropy of a mixture of two 


280 Solutions 


gases is 
S = N, log (eV/Ny) +z log (CV/N2)—-Ny fi) —Nofe(T), (93.2) 


or, from (42.8), 


S = —N, log P,—N, log Pa—Niy(T)—Nox(T) 
= —(Ni+N2) log P—N, log (N1/N)—Nz2 log (N2/N) 
—Nixy(T)—Nox2(T). (93.3) 


The free energy of the mixture is, by (42.4), 
F = —N,T log (eV /N1)—NoT log (€V/N2)+Nifi(T)+Nofo(T), (93.4) 
and similarly (42.6) gives for the potential ® 


D = N,T log P, + NT log Py+Ny,(T) + Nore(T) 
= N,(T log P+ 4%) +N,(T log P+ 42) 
+ NiT log (Ni/N) + Naf log (N2/N). (93.5) 


This expression shows that the chemical potentials of the two gases in the 
mixture are 


My = Tlog Pi +4% = T log P-+4,+T log (N,/N), 


(93.6) 
Me = T log Py +42 = T log P+ ¥2+T log (N2/N), 


i.e. each has the same form as the chemical potential of a pure gas with pres- 
sure P; or Po. 
It may be noted that the free energy (93.4) of a mixture of gases has the 


form 
F = Fi(M,, V, T)+F2(No, V,; T), 


where F and F? are the free energies of the two gases as functions of the num- 
ber of particles, volume and temperature. No similar formula is valid for the 
thermodynamic potential, however: the potential ® of the mixture has the 
form 


@ = ©1(N1, P,; T)+®2(Ne, PF, T)+N,T log (Ni/N)+NeT log (N2/N). 


Let us suppose that we have two different gases with numbers of particles 
N, and Nein vessels of volumes V1 and V2 at the same temperature and pres- 
sure, the two vessels then being connected and the gases mixed. The volume 
of the mixture becomes V1 +V2, and the pressure and temperature obviously 
remain the same. The entropy, however, changes: before mixing, the entropy 
of the two gases is equal to the sum of their entropies, 


Sy = Nj log (eV,/N,) + Ne log (eV2/N.)—Ni fi (T)—Nofe (7), 
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while after mixing the entropy is, by (93.2), 

S = Ny logt[e(V, +V2)/Ny] +o log [e(V, + V2)/Nal -Nifi — Noha - 
The change in entropy is 


AS = S—So 
= Ni log (V1+V2)/Vi] + Na log [(V1+V2)/Va], 


or, since the volume is proportional to the number of particles for given pres- 
sure and temperature, 


AS = N1 log (N/N1)-+Na log (N/Na). (93.7) 


This quantity is positive, i.e. the entropy increases on mixing, as it should, 
because the process is clearly irreversible. The quantity 4S is called the 
entropy of mixing. 

If the two gases were identical, the entropy after connecting the vessels 
would be 


S = (Ni+Ne) log [((V1+V2)/(Ni+N2)|—(NitN2)/’, 


and, since (Vi +V2)/(N1+N2) = Vi1/N1 = Va/No (the pressures and tempera- 
tures being equal), the change in entropy would be zero. 

Thus the change in entropy on mixing is due to the difference in the mole- 
ctiles of the gases that are mixed. This is in accordance with the fact that 
some work must be done in order to separate again the molecules of the two 
gases. 


§ 94, Mixtures of isotopes 


A mixture of different isotopes (in any aggregate state) is a kind of “solu- 
tion”. For simplicity and definiteness we shall speak of a mixture of two iso- 
topes of any element, but the same results apply to a mixture of any number 
of isotopes and also to chemical compounds in which different molecules 
contain different isotopes. 

In classical mechanics, the difference between isotopes is simply a differ- 
ence in mass, the laws of interaction between atoms of different isotopes being 
identical. This enables us to express the thermodynamic quantities for the 
mixture very simply in terms of those for the pure isotopes. In calculating the 
partition function for the mixture, the essential difference is that the phase 
volume element should be divided not by MN! as for a pure substance but by 
the product Ni! Ne! of the factorials of the numbers of atoms of the two 
components of the system. This gives in the free energy the further terms 


NxT log (Ni/N) +NoT log (N2/N) 
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(where N = N,+N2), which correspond to the “entropy of mixing” dis- 
cussed in § 93 for the case of a mixture of gases. 

Similar terms appear in the thermodynamic potential of the mixture, which 
may be written 


® = NiT log (N1/N) + NaF log (No/N) +NitoitNoor. (94-1) 


Here soi and jog are the chemical potentials of the pure isotopes, which 
differ only by a constant times the temperature: 


Ho1— Hos = — 3T log (73/ma), (94.2) 


where m and mz are the atomic masses of the two isotopes. This difference 
arises from the integration over the atomic momenta in the partition func- 
tion; for gases, (94.2) is simply the difference between the chemical constants 
multiplied by 7. . 

The difference (94.2) is the same for all phases of a given substance. The 
equation of phase equilibrium (the condition that the chemical potentials of 
the phases are equal) is therefore the same for every isotope. In particular, we 
can say that in the classical approximation the saturated vapour pressures of 
the various isotopes are equal. 

The situation is no longer so simple when the substance cannot be de- 
scribed by means of classical statistics. In quantum theory, the difference be- 
tween isotopes becomes considerably more profound, because of the differ- 
ences in the vibrational and rotational levels, nuclear spins, etc. 

It is important to note, however, that, even when the first correction terms 
(of order A?; see § 33) in the thermodynamic quantities are taken into account, 
the thermodynamic potential of the mixture may be written in the form (94.]), 
since the terms in question form a sum, with each term containing the mass 
of only one atom (see formula (33.15) for the free energy). These terms may 
therefore be grouped so as to include them in the chemical potentials 401 and 
oz, and hence formula (94.1) (but not, of course, (94.2)) remains valid. 

It should be pointed out that the thermodynamic potential (94.1) is for- 
mally identical with that of a mixture of any two gases (§ 93). Mixtures having 
this property are called ideal mixtures. Thus mixtures of isotopes are ideal 
mixtures up to and including terms of order f?. In this sense, mixtures of iso- 
topes form an exceptional case, since condensed (solid or liquid) mixtures of 
different substances which are not isotopes can be ideal mixtures only to 
a very rough approximation. 

Within the limits of validity of formula (94.1) we can draw certain conclu- 
sions about the vapour pressure of the isotopes over the condensed mixture. 
The chemical potentials of the two components of the mixture are 


fi = T log c1+ bo, 
fa = T log c2+ fo2z 
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(where cy = Ny/N, ce = No/N are the concentrations of the isotopes). Equat- 
ing these to the chemical potentials in the gas phase (which have the forms 
T log Pi +41(T) and T log P2 +72(T)), we find for the partial vapour pressures 


Py = Poici, P2 = Poste, (94.3) 


where Po: and Po2 denote the vapour pressures of the two pure isotopes (at a 
given temperature). Thus the partial vapour pressures of the two isotopes are 
proportional to their concentrations in the condensed mixture. 

In the classical approximation we have for the saturated vapour pressures 
of the pure isotopes Po; = Poa, as already mentioned. When quantum effects 
are taken into account, however, the two vapour pressures are no longer 
equal. The difference cannot be calculated in a general form applicable to all 
substances. Such a calculation can be made only for monatomic elements 
(the inert gases) as far as the terms of order f? (K. F. Herzfeld and E. Teller, 
1938). 

The correction to the thermodynamic potential of a liquid phase is given 
by formula (33.15);' taking the value per atom, we find the chemical poten- 
tial 

[b= er + (f?/24mT)F?, 


where F? is the mean square of the force exerted on one atom by the other 
atoms in the liquid. The chemical potential of the gas remains equal to its 
classical value, since the interaction between atoms in the gas is negligible. 
Equating the chemical potentials of the liquid and the gas, we find the cor- 
rection to the classical value of the vapour pressure, and the required differ- 
ence of vapour pressures between the the two isotopes is 


WF (1 1 
ey ra) ia eae mi 94.4 

Poi—Po2z = Po ATA (3 ae ), (94.4) 
where Pp is the common classical value of Po: and Pog. We see that the sign 
of the difference is determined by that of the difference of the reciprocal mas- 
ses of the isotopes, the vapour pressure of the lighter isotope being the greater. 


§ 95. Vapour pressure over concentrated solutions 


Let us consider the equilibrium of a solution with the vapour over it, which 
in general also contains both substances. The solution may be either weak 
or strong, i.e. the quantities of the two substances in it are arbitrary. It will 
be remembered that the results derived in § 89 apply only to weak solutions. 


t We again make use of the fact that small increments to the various thermody- 
namic potentials, when expressed in terms of the corresponding variables, are 
equal (§ 15). 
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Since the solution and the vapour are in equilibrium, the chemical poten- 
tials 41 and pe in the solution and in the vapour are equal. If the numbers of 
particles of the two substances in the solution are N,, and N2,, we can write 
the expression (24.14) for the solution in the form 


df2 = —Nj, dui—Nos due—S, d7—P dV, 7 (95.1) 


where S, and V, are the entropy and volume of the solution; the temperature 
T and pressure P are the same for the solution and the vapour. 

We shall assume that the vapour over the solution is so rarefied that it may 
be regarded as an ideal gas; its pressure is small. Then we can neglect in (95. 1) 
the terms proportional to P, viz. P dV and dQ. Let us first consider all deriv- 
atives to be taken at constant temperature. Then (95.1) gives 


Nie dpi +Nos dus = 0. (95.2) 
For the gas phase we have 
Mag = T log P+ (7), 
Mag = T log Py +2(T); 
where P; and Ps are the partial pressures of the two components of the 
vapour. Differentiating these expressions (with 7’ constant), we find 
duig = TdlogPi, due = Td log Po. 


Substitution in (95.2) gives 
Ni, d log Py +Nez d log Pz = 0. (95.3) 


The concentration & of the solution can be defined as the ratio of the num- 
ber of particles of the first component to the total number of particles: 


E a Mi;/(Ni1s + Nas), 


and we can similarly define the concentration x of the vapour. The partial 
pressures P; and Pz are equal to the total pressure P of the vapour multiplied 
by the concentrations of the corresponding components, i.e. P1 = xP, 
P, = (1—x)P. Substituting these values in (95.3) and dividing this equation 
by the total number of particles in the solution, N = N,,+N.,, we find 


Ed log Px +(1—€) d log P(l—x) = 0, 
whence 
dlog P = (x—&) dx/x(1 —x), 
or 
E= x—x(1—x) 9 log P/dx. (95.4) 


This equation relates the solution and vapour concentrations to the depend- 
ence of the vapour pressure on the vapour concentration. 
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One further general relation can be obtained by considering the dependence 
of quantities on temperature. The condition for equality of the chemical po- 
tentials of one component, say the first, in the vapour and in the solution is 
th, = 0®,/N,,. Dividing both sides by T and using the fact that the deriva- 
tive with respect to the number of particles is taken at constant temperature, 
we write 

Mig = ) ®, 

“T OM, T’ 
and then take the total derivative of each side with respect to temperature. 
In doing so we may assume with sufficient accuracy that the thermodynamic 
potential of the condensed phase (the solution) is independent of pressure. 
Noting also that the partial derivative with respect to temperature is 


o ®@ o® W 
or Fa 7 (2 Tx eq 
we obtain the relation 
r Clog Py _ OW, (95.5) 


ar. OM, 


Here w,, is the molecular heat function of the first substance as a gas; the 
derivative OW,/ON,, gives the change in the heat function of the solution 
when one molecule of that substance is added to it. The quantity on the 
right of (95.5) is therefore the heat absorbed when one particle of the first 
substance goes from the solution to the vapour. 

For the first substance in the pure state, the relation (95.5) becomes the 
otdinary Clapeyron—Clausius equation, 


re] log Pio 


POT 


= Wig Wis 


where Po is the vapour pressure of the first substance in the pure state, and 
W,, its molecular heat function when liquid. Subtracting this equation term 
by term from (95.5), we have finally 


cs) 


Py 
T? al log Pio =—-qW1> (95.6) 


where qg, = OW,/ON,,—W,, denotes the molecular heat of dilution, i.e. the 
quantity of heat absorbed when one particle from the liquid first substance 
goes into the solution. A similar relation can, of course, be written down for 
the second substance also. 
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§ 96. Thermodynamic inequalities for solutions 


It has been shown in § 21 that a body can exist only in states for which 
certain conditions called thermodynamic inequalities are satisfied. These con- 
ditions were derived, however, for bodies consisting of identical particles. We 
shall now give a corresponding analysis for solutions, taking only the case of 
a mixture of two substances. 

In § 21 the condition of equilibrium used was not the condition of maxi- 
mum entropy of a closed system as a whole but the equivalent condition 
which requires that the minimum work needed to bring any small part of the 
system from the equilibrium state to any neighbouring state should be posi- 
tive. 

We now use a similar procedure, considering some small part of the solu- 
tion, which contains N solvent and 7 solute particles, say. In the equilibrium 
state the temperature, pressure and concentration in this small part are equal 
to their values in the rest of the solution (which acts as an external medium). 
Let us determine the minimum work needed to bring the temperature, pres- 
sure and number of solute particles in the small part considered (containing a 
fixed number N of solvent particles) to values which differ by small but 
finite amounts 67, dP and 6n from their equilibrium values. 

The minimum work will be done if the process occurs reversibly. The 
work done by an external source is then equal to the change in the energy of 


the system, i.e. 
ORmin = OE +6Eo; 


quantities without suffix refer to the small part considered, and those with 
suffix zero refer to the remainder of the system. We express dE in terms of 
the changes in the independent variables: 


bRmin = 6E +71 dSo—Po V0 + fo O70, 


where jo is the chemical potential of the solute in the medium; the number 
of solvent particles is unchanged in the process considered, and so the 
corresponding term for the solvent may be omitted. From the reversibility of 
the process it follows that 6So = —6S, and from the conservation of the 


t The differential of the energy of the medium (at constant N) is 
dEy = To dSy—Po dVo+po dito. 


Since the quantities 7), Pp, 49 may be regarded as constant, integration of this 
relation leads to a similar relation between the finite variations of the quantities 
Eos Sos Vos Ao- 

The quantity 44 should not be confused with the chemical potential of the pure 
solute. 
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total volume and quantity of solute in the whole solution we have 6V = 
—6Vo, 6n = —6np. Substituting these, we obtain the final expression for 
the work: 


8Rmin = SE—Ty 6S + Py OV —po 6n. (96.1) 


Thus the condition of equilibrium can be taken to be that for any small part 
of the solution the inequality 


d6E—Ty dS + Po 6V — pp 6n > 0 (96.2) 


holds. Henceforward, as in § 21, we shall omit the suffix zero in expressions. 
which are coefficients of the deviations of quantities from their equilibrium 
values; the values of these expressions in the equilibrium state will always be 
meant. 

We expand 6EF in powers of 6V, 6S and 6n (regarding E as a function of V,, 
S and n). As far as the second-order terms this gives 


a 2, =a (asy +— ee E (v4 oe = (on)? 


oe of OE OE 
+2 ~~ 6S 6bV +2 ~— 6S 6n+2—-— OV dn 


aS OV OS on ov on]. 


But 0E/OV = —P, dE/8S = T, dE/8n = pw’. Thus the first-order terms can- 
cel on substitution in (96.2), leaving 


ee 
ee 24 24 2 
2 6Rmin aa OE 6sy+ aa E ovye+ = (én) 
O2E OE OE 
+2 aoae OS OV +2 OS in n+2—0 2 ov bn > 0. (96.3) 


It is known from the theory of quadratic forms that, for a form in three 
variables (here 5S, 6V, 6) to be everywhere positive, its coefficients must 
satisfy three conditions, which for the form (96.3) are 


[Sz/OV2 BE/AVaS sE/aV 2" 

SEBS OV BES? G*E/BS On > 0, 

‘@E/On OV BE/OndS O2E/an® (96.4) 

|eE/OV?  sE/avas| 
0, S£/AS* > 0. 

‘@E/AS OV E/aS? ier 


20 
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Substituting the values of the derivatives of E with respect to V, S and n, we 

can write these conditions as 
| OP/OV OP/OS eaP/dn 
OT /eV oT/0S er /on | < 0, 
jouw /oV ou’/oS dp’/on, 
|OP/oV.  OP/AS | 
OT/aV AT /as | 


<0, eT/aS > 0. 


These determinants are Jacobians: 


GER-9 EB)-6 B),_-e 


The second and third conditions give the already known inequalities 
(OP/OV)r, , < 0 and C, > 0. The first condition may be transformed as 
follows: 


OP, T, pv’) m OCP, T, w’)/A(P, T, 7) 
oV,S,n) ov, S, n)/o(P, T, n) 
as (u'/On)p, <0 
(av, S)/O(P, T))n , 


Since the denominator is negative by the second condition (96.5), we must 
have 


(Op'/On)p, 7 > 0. (96.6) 
Using instead of n the concentration c = n/N, we find (since N is constant) 
(Ou’/Oc)p, r > 0. (96.7) 


Thus, as well as the inequalities (6P/OV), , < 0, C, > 0, the inequality 
(96.7) must also be satisfied in solutions. 

It may be noted that for weak solutions Oy’/dc = T/c, so that the in- 
equality (96.7) is always satisfied. 


The case where 
(Oy’/Oc)p, r = 0 (96.8) 


needs special consideration. 

The equality (96.8) corresponds to the vanishing of the first determinant 
in (96.4) (the third-order determinant). In this case the quadratic form 
(96.3) may vanish for certain values of 6S, 6V and 6n, and higher-order 
terms in its expansion would have to be examined in order to ascertain the 
conditions for the inequality (96.2) to be satisfied 

However, we shall see in § 97 that such a state is a critical point for the 
equilibrium of two liquid phases (two solutions with different concentra- 
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tions), which is analogous to the critical point of a liquid and a vapour. 
As in the latter case, the critical point in solutions is actually a singularity 
of the thermodynamic functions of the substance, and a regular expansion 
of these becomes impossible there. We shall merely mention that the regular 
expansion would lead (as in§ 152 for the critical point of a liquid and a va- 
pour) to the conditions 


(2y'/Oc%)p,r=0, (B8u'/Ac¥)p, r > 0, (96.9) 
which would have to be satisfied together with (96.8). 


§ 97. Equilibrium curves 


The state of a body consisting of identical particles is defined by the values 
of any two quantities, for instance P and T. To define the state of a system 
having two components (a binary mixture) it is necessary to specify three 
quantities, for instance P, T and the concentration. In this and subsequent 
sections, the concentration of the mixture will be defined as the ratio of the 
quantity of one of the substances to the total quantity of both, and will be 
denoted by x; clearly x takes values from 0 to 1. The state of a binary mixture 
may be represented by a point in a three-dimensional coordinate system, 
whose axes correspond to these three quantities (just as the state of a system 
of identical particles was represented by a point in the PT-plane). 

According to the phase rule, a two-component system can consist of not 
more than four phases in contact. The number of degrees of freedom of such 
a system is two when there are two phases, one for three phases, and none 
for four phases. The states in which two phases are in equilibrium are there- 
fore represented by points forming a surface in the three-dimensional 
coordinate system; states with three phases (triple points) by points forming 
a line (called the line of triple points or the three-phase line) and states with 
four phases by isolated points. 

It has already been shown in § 81 that, for systems with only one compo- 
nent, the states in which two phases are in equilibrium are represented by a 
curve in the PT-plane; each point on this curve determines the pressure and 
temperature (which are the same in both phases, from the conditions of 
equilibrium). Points not lying on the curve represent homogeneous states of 
the system. If the temperature and volume are taken as coordinates, the 
phase equilibrium is represented by a curve such that points within it repre- 
sent states where there is separation into two phases represented by the 
points of intersection of a straight line 7 = constant with the equilibrium 
curve. 

The situation is similar for mixtures. If we take as coordinates P, T and 
the chemical potential of one component (i.e. quantities which have equal 
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values for phases in contact), equilibrium of two phases is represented by a 
surface, each point of which determines P, T and u for the two phases in 
equilibrium. When three phases are present, the points representing their 
equilibrium (triple points) will lie on the curves of intersection of the equi- 
librium surfaces for each pair of them. 

The use of the variables P, 7, u is inconvenient, however, and in what 
follows we shall use P, 7, x as independent variables. In terms of these 
variables the equilibrium of two phases is represented by a surface whose 
points of intersection with a straight line P = constant, T = constant repre- 
sent the states of the two phases in contact for the relevant values of P and 
T (i.e. determine the concentrations of the phases, which may of course be 
different). The points on this line between the two points of intersection 
represent states in which a homogeneous body is unstable and therefore 
separates into two phases (represented by the points of intersection). 

We shall generally use two-dimensional diagrams with P and x, or T and 
x, aS coordinates; the lines of intersection of the equilibrium surface with 
the planes of constant temperature or pressure can then be drawn. We shall 
call these lines equilibrium curves. 

Let us consider the points on an equilibrium curve at which the concen- 
tration becomes equal in the two phases. Two cases are possible: (1) at such a 
point all other properties of the two phases also become equal, i.e. the phases 
become identical, (2) at such a point two distinct phases continue to exist. 
In case (1) the point is said to be a critical point, in case (2) it will be called a 
point of equal concentration. 

Near a critical point the equilibrium curve has the form shown in Fig. 20, 
or a similar form with a minimum at the critical point K (the abscissa being 
x and the ordinate P or 7’; the curve is then the intersection of the equilibrium 
surface with a plane of constant temperature or constant pressure respec- 
tively). Points lying within this curve (in the hatched region) represent states 
in which there is separation into two phases; the concentrations in these 
phases are determined by the points of intersection of the curve with the 
appropriate horizontal line. At the point K the two phases coalesce. A con- 
tinuous passage can be effected between any two points in the non-hatched 
region along any path that passes round the critical point. 


P,T P,T 

| K | | K | 
| | | A’ | 
| i | 
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Figure 20 shows that near the critical point there exist states in which two 
phases are in equilibrium which have concentrations x and x +6x differing 
by an arbitrarily small amount. For such phases the equilibrium condition 
is u(P, T, x) = w(P, T, x +6x), where u is the chemical potential of one of 
the substances in the mixture. Hence we see (cf. § 83) that at the critical point 
the condition 


(Ou /0x)p, r = 0 (97.1) 


must hold. 

| This condition is identical with (96.8), and hence the two definitions of 
the critical point (here and in § 96) are equivalent. It may be noted that u 
in (97.1) signifies the chemical potential of either of the two substances in the 
mixture; but the two conditions obtained by taking these two chemical 
potentials in (97.1) are actually equivalent. This is easily seen by noting that 
each of the chemical potentials is the derivative of ® with respect to the 
corresponding number of particles, and @ is a first-order homogeneous func- 
tion in both numbers of particles. 

The critical points clearly form a line on the equilibrium surface. 

Near a point of equal concentration the equilibrium curves must have 
the form shown in Fig. 21, or a similar form with a minimum at the point 
K. The two curves touch at the maximum (or minimum). The region between 
the two curves is that where separation into phases occurs. At the point K 
the concentrations of the two phases in equilibrium become equal, but the 
different phases continue to exist, since any path between the points which 
coincide at K must pass through the region of separation into two phases. 
Like critical points, points of equal concentration lie on a curve on the 
equilibrium surface. 

Let us now consider the properties of the equilibrium curves at low con- 
centrations (i.e. when one of the substances is present in the mixture in a 
considerably smaller quantity than the other; x is close to zero or to unity). 

-It has been shown in § 89 that at low concentrations (weak solutions) the 
difference between the phase equilibrium temperatures of solutions and of 
the pure substance (at a given pressure) is proportional to the difference of 
concentrations of the two phases. The same applies to the pressure difference 
at a given temperature. Moreover, it has been shown in § 90 (also for low 
concentrations) that the ratio of concentrations in the two phases depends 
only on P and T, and so it may be regarded as constant in the neighbourhood 
of x = 0. 

From the above it follows that at low concentrations the equilibrium 
curves have the form shown in Fig. 22, i.e. consist of two straight lines 
intersecting on the ordinate axis (or a similar form with the straight lines 
ascending). The region between the two lines is the region of separation into 
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phases. The regions above and below the lines are the regions of the two 
different phases. 

At the beginning of this section it has already been mentioned that a 
system with two components may consist of three phases in contact. Near a 
triple point the equilibrium curves appear as shown in Fig. 23. All three 
phases have equal pressure and temperature in equilibrium. The points 
A, B, C which determine their concentrations therefore lie on a straight 
line parallel to the axis of abscissae. The point A, which gives the concen- 
tration of the first phase at the triple point, is the point of intersection of the 
equilibrium curves 12 and 13 between the first and second, and first and 
third, phases. Similarly, the points B and C are the intersections of the equi- 
librium curves 12 and 23 between the first and second, and second and third, 


Fic. 22 Fic. 23 


phases (B), and of the equilibrium curves 23 and 13 between the second and 
third, and first and third, phases (C). The points A, B, C are of course, the 
points of intersection of the plane P = constant or T = constant with three 
lines on the equilibrium surface; we shall call the line corresponding to the 
point B a line of triple points or a three-phase line. The regions I, II, II rep- 
resent states of the separate phases, first, second and third. The region 
between the two curves 13 below the line ABC is the region of separation into 
the first and third phases, and those between the two curves 12 and the two 
curves 23 (above the line ABC) are respectively the regions of separation into 
the first and second, and second and third, phases. Region IT must obviously 
lie entirely above ABC (or entirely below ABC). At the points A, B and C the 
curves 12, 13 and 23 intersect, in general, at certain angles, and do not join 
smoothly. The directions of the curves 12, 13, 23 need not necessarily be as 
shown in Fig. 23, of course. The only essential feature is that the curves 12 
and 23 and the curves 13 must lie on opposite sides of the straight line ABC. 

If any of these singular lines on the equilibrium surface is projected on the 
PT-plane, the projection divides this plane into two parts. For a critical line, 
the points projected on one part are those corresponding to the two different 
phases and those corresponding to separation into these phases. The other 
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part of the PT-plane contains the projections of points which represent 
homogeneous states, at none of which does separation into two phases 
occur. In Fig. 24 the dotted line represents the projection of a critical line 
on the PT-plane. The letters a and b denote the two phases. The symbol a-b 
signifies that this part of the plane contains the projections of the two phases 
and those of states where these two phases are present in equilibrium. The 
symbol ab denotes the single phase into which the phases a and b merge above 
the critical points. 


rT 
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The projection of a three-phase line similarly divides the PT-plane into 
two parts. Figure 25 shows which points are projected on the two parts. 
The symbol a-b-c signifies that this region contains the projections of points 
which represent states of the phases a, b, c and states in which there is 
separation into phases a and b or b and c. 

Figure 26 shows a similar projection for a line of points of equal concentra- 
tion, and Fig. 27 for a line of phase equilibrium for a pure substance (i.e. 
x = 0 or x = 1); the latter, of course, lies in the PT-plane. The letter b in 
Fig. 27 signifies that this part of the plane contains the projections of points 
corresponding to states of phase b only. In the sequence of letters in the 
symbols a-b, a-b-c the letter b will be understood to denote a phase whose 
concentration is higher than that of a, and c a phase whose concentration is. 
higher than that of b.t 

It may be noted that the four types of singular point on the equilibrium 
curves (triple point, point of equal concentration, critical point, and pure- 


* To avoid misunderstanding, we should emphasise that the notation a-b-c for 
a line of equal concentration (unlikea three-phase line) is to some extent arbitrary: 
the letters a and c here denote states which are not two essentially different phases, 
since they never exist simultaneously in contact. 
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substance point) correspond to the four possible types of maximum (or 
minimum) on these curves. 

If any phase has the same fixed composition everywhere (i.e. independ- 
ently of the values of P and T), the equilibrium curves become somewhat 
simpler near the points here considered. Such phases are a chemical com- 
pound of the two components or pure-substance phases, which always 
have concentration x = 0 (or x = 1). 

Let us consider the form of the equilibrium curves when there are phases 
of constant composition, near points where the lines corresponding to these 
phases terminate. It is evident that such points must be maxima or minima 
of the equilibrium curves, and thus are among the types of point considered 
in this section. 
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If the phase of constant composition is a pure-substance phase with con- 
centration x = 0, the corresponding line coincides with the P-axis or the 
T-axis and can terminate at a point of the kind shown in Fig. 28. This dia- 
gram gives the form of the equilibrium curve near such a point; one of the 
lines in Fig. 22 coincides with the axis of ordinates. 

If one phase is a chemical compound of fixed composition, then near a 
point of equal concentration the equilibrium curve has the form shown in 
Fig. 29, i.e. the inner region in Fig. 21 becomes a vertical line. The hatched 
region on either side of this line is the region of separation into two phases, 
one a chemical compound whose composition is given by the vertical line. 
The curve has no break at the maximum, as in Fig. 21. 

Similarly, near a triple point the equilibrium curves have the form shown 
in Fig. 30. The phase which is a chemical compound is represented by a ver- 
tical line, to which region IT (Fig. 23) here reduces. 
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§ 98, Examples of phase diagrams 


In this section we shall enumerate the principal types of equilibrium curve; 
iff contrast to § 97, their form will be considered in general, and not only near 
the singular points. These curves (also called phase diagrams) can have many 
forms, but in most cases they belong to one of the types given below, or are 
a combination of more than one of these. The hatched regions in all these 
diagrams are the regions of separation into phases, and the remaining regions 
are those. of homogeneous states. The points of intersection of horizontal 
lines with the curves bounding the regions of separation into phases deter- 
‘mine the composition of the phases into which separation occurs (for given 
P and T). The relative amounts of the two phases are determined by the 
“lever rule” already mentioned in § 81. 


'' In what follows we shall for definiteness discuss Tx-diagrams; similar types 

are possible in the coordinates P nd x. The concentration x is taken as 
abscissa, and varies from 0 to 1. 

1. There are two phases; each can have any concentration (i.e. the two 
‘components mix in any proportion in both phases). In the simplest case, 
where the curves have no maxima or minima (apart from the pure-substance 
point), the phase diagram has the “cigar” form shown in Fig. 31. 

Let one of the phases be a liquid (the region below the cigar), and the 
other a vapour (the region above the cigar). Then the upper curve of the 
cigar is called the condensation point curve, and the lower curve the boiling 
point curve. 

If a liquid mixture of given composition is heated, then the liquid will 
begin to boil at a temperature determined by the intersection B of the vertical 
line AD (corresponding to the given concentration) with the lower curve of 
the cigar. Vapour boils off, whose composition is given by the point C, i.e. 


t The laws of boiling and condensation ‘of liquid mixtures were established by 
D. P. Konovalov (1884). 
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has a lower concentration than the liquid. The concentration of the remain- 
ing liquid is obviously increased, and its boiling point accordingly rises. 
On further heating, the point which represents the state of the liquid phase 
will move upwards along the lower curve, and the point which represents the 
vapour leaving the liquid will move upwards along the upper curve. Boiling 
ceases at various temperatures, depending on the way in which the process 
takes place. If boiling occurs in a closed vessel, so that all the vapour gener- 
ated remains permanently in contact with the liquid, the liquid will obvi- 
ously boil away completely at a temperature where the concentration of the 
vapour is equal to the original concentration of the liquid (the point D). 
In this case, therefore, boiling begins and ends at temperatures given by 
the intersection of the vertical line AD with the lower and upper curves of 


T 
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the cigar. If the vapour boiling off is steadily removed (boiling in an open 
vessel), then only the vapour just evolved will be in equilibrium with the liquid 
at any given time. In this case, it is evident that boiling will cease at the boiling 
point G of the pure substance, where the liquid and vapour compositions 
are the same. The condensation of vapour into liquid occurs in a similar 
manner. 

The situation is exactly analogous when the two phases are a liquid (above 
the cigar) and a solid (below the cigar). 

2. The two components mix in any proportion in both phases (as in case 
1), but there is a point of equal concentration. The phase diagram then has 
the form shown in Fig. 32 (or a similar form with a minimum). At the point 
of equal concentration, the two curves touch, and both have a maximum or 
a minimum. 

The transition from one phase to another occurs in the same way as 
described for case 1, except that the process can terminate (if one phase is 
steadily removed, as for example by boiling a liquid in an open vessel) not 
only at the pure-substance point but also at the point of equal concentration. 
At the composition corresponding to this point, the entire process occurs 
at a single temperature.’ 


+ A mixture corresponding to the point of equal concentration is also said to be 
azeotropic. 
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3. There are two phases, liquid and gas, in which the two components 
mix in any proportion, and there is a critical point. The phase diagram is as 
shown in Fig. 33 (K being the critical point). The region to the right of the 
curve corresponds to liquid states, and that to the left to gaseous states. 
It should be remembered, however, that when there is a critical point the 
liquid and gaseous phases can, strictly speaking, be distinguished only when 
they are in equilibrium with each other. 

A diagram of this type leads to the following curious effect. If a liquid 
whose composition is represented by the line AC (passing to the right of the 
point K) is heated in a closed vessel, then, after boiling begins (at the point 
B), the quantity of vapour will gradually increase as heating continues, but 
after a certain time it begins to decrease again, and the vapour disappears 
entirely at the point C. This is called retrograde condensation. 
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4. Two liquids which mix, but not in all proportions. The phase diagram 
is as shown in Fig. 34. At temperatures above that of the critical point K, the 
components mix in any proportion. Below this temperature the components 
do not mix in the proportions represented by points within the hatched 
region. In this region there is separation into two liquid mixtures whose 
concentrations are given by the points of intersection of the corresponding 
horizontal line with the equilibrium curve. Similar diagrams are possible 
with K a minimum, or with two critical points, an upper and a lower, so 
that the region of separation into two phases (two solutions) is bounded by 
a closed curve. 

5. In the liquid (or gaseous) state the two components mix in any propor- 
tion, but in the solid (or liquid) state they do not mixinall proportions (lim- 
ited miscibility). In this case there is a triple point. According as the tem- 
perature of the triple point lies below the pure-component phase equilib- 
rium temperatures (the points 4 and C) or between them (it obviously can- 
not lie above them on the assumption made here that the components mix 
in any proportion in the higher phase), the phase diagram appears as in 
Fig. 35 or Fig. 36 respectively. For example, let the phase of unlimited 
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miscibility be a liquid, and that of limited miscibility be a solid. The region 
above the curve ABC (Fig. 35) or ADC (Fig. 36) is the region of liquid states; 
the regions bounded by the curves ADF and CEG (Fig. 35) or ABF and CEG 
(Fig. 36) are the regions of homogeneous solid phases (solid solutions). At 
the triple point (whose temperature is given by the line DBE) the liquid and 
two solid solutions of different concentrations are in equilibrium. The point 
B in Fig. 35 is called the eutectic point. A liquid mixture whose concentration 


corresponds to this point freezes completely, without change of concentra- 
tion, whereas at other concentrations a solid mixture freezes out, with a 
concentration different from that of the liquid. The regions ADB and CBE 
(Fig. 35) and ADB and CDE (Fig. 36) correspond to separation into a liquid 
phase and one of the solid phases; the regions DEGF (Fig. 35) and BEGF 
(Fig. 36) correspond to separation into two solid phases. 

If, in a diagram such as Fig. 35, the components do not mix at all in the 
solid state, the phase diagram takes the form shown in Fig. 37. In the 


hatched regions above the line ABC, the mixed liquid phase is in equilibrium 
with the solid phase of one of the pure substances, and below ABC we have 
the two pure solid phases. When the temperature of the liquid mixture 
decreases, one or the other of the pure substances freezes out according as 
the concentration of the liquid lies to the right or the left of the eutectic 
point. As the temperature decreases further, the composition of the liquid 
varies along the curve DB or EB, and the liquid freezes completely at the 
eutectic point B. 

6. In the liquid state the two components mix in any proportion, but in the 
solid state they do not mix at all, forming only a chemical compound of 
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definite composition. The phase diagram is shown in Fig. 38. The straight 
line DE gives the composition of the chemical compound. There are two 
triple points, B and G, at which there is equilibrium between the liquid phase, 
the solid: chemical compound, and the solid phase of one of the pure com- 
ponents. Between the points B and G lies a point of equal concentration, D 
(cf. Fig. 29). It is easy to see where separation occurs, and which are the 
resulting phases: in the region DBE they are a liquid phase and the solid 
chemical compound; below the line CE, the chemical compound and one of 
the solid pure substances, and so on. The freezing of the liquid terminates 
at one of the eutectic points G and B, according as the concentration of the 
liquid lies to the right or to the left of the line DE. 


| 
! 
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7. In the liquid state the two components mix in any proportion, but in 
the solid state they do not mix at all, forming only a chemical compound; 
this compound, however, decomposes at a certain temperature, before it 
melts. The straight line defining the composition of this compound cannot 
terminate at a point of equal concentration as in case 6, since it does not 
reach the melting point. It can therefore terminate at a triple point of the 
type shown in Fig. 30, § 97 (the point A in Fig. 39). In Fig. 39, which shows 
one possible form of the phase diagram for this case, it is easy to see which 
phases result from the separation at various points in the hatched region. 

8. In the solid state the components do not mix at all, and in the liquid 
state they mix only in certain proportions. In this case there are two triple 
points, at which the liquid is in equilibrium with the two solid pure substan- 
ces (the point B in Fig. 40) or one of the pure substances is in equilibrium 
with two mixed liquid phases of different concentrations (the point D). The 
regions not hatched in Fig. 40, above ABC and above DE, represent liquid 
states with various concentrations; the hatched region above CD is that of 
separation into two liquid phases, DEF is that of separation into a liquid 
and one of the solid pure substances, and so on. 
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§ 99, Intersection of singular curves on the equilibrium surface 


The four kinds of line discussed in § 97 (critical lines, three-phase lines, 
lines of equal concentration and pure-substance lines) all lie on the same 
surface, the equilibrium surface. They will therefore in general intersect. 
Some properties of the points of intersection of such lines are described 
below. 

It may be shown that no two critical lines can intersect, nor can two lines 
of equal concentration. We shall not pause to prove these statements 
here. 

Let us now list (again without proof) the properties of the remaining points 
of intersection. All these properties follow almost immediately from the gen- 
eral properties of equilibrium curves given in § 97. The diagrams will show 
the projections of the intersecting lines on the PT-plane (see § 97); their form 
is, of course, chosen arbitrarily. A dotted line everywhere denotes a critical 
line; a continuous line, a line of phase equilibrium for a pure substance; 
a broken line, a line of equal concentration; and a dot-and-dash line, a three- 
phase line. The letters have the same significance as in Figs. 24 to 27 
(§ 97). 

At a point of intersection between a critical line and a pure-substance line 
(Fig. 41a) both lines terminate, and similarly for a critical line and a three- 
phase line (Fig. 41b). When a pure-substance line intersects a line of equal 
concentration, only the latter terminates (Fig. 41c), the two curves touching 
at the point of intersection. The same occurs when a line of equal concentra- 
tion meets a critical line (Fig. 41d) or a three-phase line (Fig. 41e). In each 
case the line of equal concentration terminates at the point of intersection, 
the two curves touching at this point. 
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The point of intersection of three-phase lines (Fig. 41f) is a quadruple 
point, i.e. a point where four phases are in equilibrium. Four three-phase 
lines meet at this point, corresponding to equilibrium between each three of 
the four phases. 
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Finally, the point where a pure-substance line intersects a three-phase line 
(Fig. 41g) must clearly be also a point of intersection between the three-phase 
line and all three pure-substance phase equilibrium lines (corresponding to 
equilibrium between each two of the three pure-substance phases). 


§ 100. Gases and liquids 


Let us now consider in more detail the equilibrium of liquid and gaseous 
phases consisting of two components. 

When the temperature is sufficiently high (T large in comparison with the 
mean interaction energy of the molecules) all substances mix in any propor- 
tion. On the other hand, since a substance is a gas at such temperatures, we 
can say that all substances have unlimited miscibility in the gas phase 
(although when there are critical lines the difference between the liquid and 
the gas becomes to some extent arbitrary, and so likewise does the foregoing 
formulation). 

In the liquid state, some substances mix in any proportion, others only in 
certain proportions (liquids of limited miscibility). 

In the former case, when the two components mix in any proportion in 
both phases, the phase diagrams contain no triple points, since the system 
cannot consist of more than two phases (all liquid states are one phase, and 
the same applies to gaseous states). Let us consider the projection of the 
singular lines of the equilibrium surface on the PT-plane. There are two lines 
of phase equilibrium for the pure substances (i.e. for concentrations x = 0 
and x = 1 in both phases). One of these lines is itself in the PT-plane, and 
the other is in a plane parallel to it, so that its projection has the same form 
as the line itself. Each of these lines terminates at a point which is a critical 
point for phases of the corresponding pure substance. A critical line begins 
and ends at these points (at a point of intersection of a critical line anda pure- 
substance line, both terminate; see § 99). Thus the projection of these various 
lines on the PT-plane has the form shown in Fig. 42; the notation is the same 
as in §§ 97 and 99. The letters g and / have a similar significance to a, b, ¢ 
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in the diagrams in §§ 97 and 99: g denotes gas, and / liquid. The regions g 
and / contain projections of gaseous and liquid states respectively; the region 
g-l includes these and also states where separation into liquid and gas 
occurs; above the critical line, the difference between liquid and gas does not 
exist. 

If there is also a line of equal concentration, the projection on the PT- 
plane is as shown in Fig. 43. The projection of the line of equal concentra- 
tion lies either above the line from the origin O to B (as in Fig. 43), or below 
OC, but not between them. Only A, B, C are points of intersection of lines. 
The point D does not correspond to a true intersection of the pure-substance 
line with the critical line; these intersect only in projection. The letters /; and 
J, in the diagram denote liquid phases of different concentrations. Above the 
line of equal concentration there is only one liquid phase.‘ 

All these properties of the projections of the singular lines on the PT-plane 
become obvious if we consider the phase diagrams corresponding to the 
cross-sections of the equilibrium surface by various planes of constant tem- 
perature (or pressure). For example, the cross-sections corresponding to 
pressures below that at B and to pressures between those of 4 and B in 
Fig. 42 give phase diagrams as shown in Figs. 31 and 33 respectively. Figure 
44 shows cross-sections for various successive temperature ranges in Fig. 43 
(T,, T3, Tc being the temperatures corresponding to the points A, B, C): 
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+t Not being concerned with solid phases, we shall conventionally show lines 


in all the (P, T) diagrams as starting from the origin, as if solidification did not 
occur. on 
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the region of separation into two phases “breaks up” at the point of equal 
concentration, and two critical points are formed; thereupon, first one and 
then the other hatched region shrink to a point on the ordinate axis and dis- 
appear. Figure 45 shows similar cross-sections for successive pressure ranges. 

If the two components have limited miscibility in the liquid state, there is a 
three-phase line, which terminates at a point where it intersects a critical line 
starting from that point. Figures 46 and 47 show the two essentially different 
types of (P, 7) projection that can occur in this case. They differ in that in 
Fig. 46 the projection of the three-phase line lies above both the pure-sub- 
stance lines, while in Fig. 47 it lies between them; the three-phase line cannot 
lie below both the pure-substance lines, since in the gaseous state the two 
components mix in any proportion. In both cases there are two critical lines, 
one of which runs out towards high pressures. 


2! 


304 Solutions 


Figures 48 and 49 show a number of successive cross-sections by Px and 
Tx planes for the case shown in Fig. 46. 

In conclusion, it should be emphasised that the examples of (P, 7) dia- 
grams discussed in this section are only the most typical ones for equilibrium 
of liquid and gaseous phases; they do not exhaust all theoretically possible 
forms. 


CHAPTER X 


CHEMICAL REACTIONS 


§ 101. The condition for chemical equilibrium 


A CHEMICAL reaction occurring in a mixture of reacting substances ultimately 
leads to the establishment of an equilibrium state in which the quantity of 
each of the substances that take part in the reaction no longer changes. This 
case of thermodynamic equilibrium is called chemical equilibrium. Any chem- 
ical reaction can take place, in general, in either direction; until equilibrium 
is reached, one direction predominates, but in equilibrium the two opposite 
reactions occur at rates such that the total numbers of particles of each of 
the reacting substances remain constant. The object of thermodynamics as 
applied to chemical reactions is to study only the chemical equilibrium, not 
the course of the reaction leading to that equilibrium. 

It is important to note that the state of chemical equilibrium is independ- 
ent of how and under what conditions the reaction occurred;' it depends 
only on the conditions under which the mixture of reacting substances exists 
in equilibrium. In deriving the condition for chemical equilibrium, we can 
therefore make any desired assumptions concerning the course of the 
reaction. 

First of all, we shall describe the method to be used for expressing the 
reaction. Chemical reactions are commonly written as symbolic equations, 
which, if all the terms are taken to one side, have the form 

> v;Aj; = 0, (101.1) 
i 
where the A, are the chemical symbols of the reacting substances, and the 
coefficients v, are positive or negative integers. For example, in the reaction 
2H2+O2 = 2H20 or 2H2+O2—2H20 = 0 the coefficients are ry, = 2, 
Y¥%, = 1, 'y,90 = —2. 

Let us assume that the reaction occurs at constant temperature and pres- 
sure. In such processes the thermodynamic potential of the system tends to 
a minimum. In equilibrium, therefore, the potential ® must have its least 


t In particular, it is independent of whether a catalyst took part in the reaction. 
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possible value (for given P and 7). Let Ni, No, ... be the numbers of par- 
ticles of the various substances taking part in the reaction. Then the necessary 
condition for ® to be a minimum can be written as the vanishing of the total 
derivative of © (for given P and T) with respect to one of the N,, say N,: 


o® pce GONG o® dN35 


ON, + @Na dNv + ON, dN, + 


The changes in the numbers N, during the reaction are related by the reaction 
equation: it is clear that, if N, changes by »,, each of the other N, will change 
by »,, so that dN, = (»/»,) dN,, dN,/dN, = »,/r,. The foregoing equation 
may therefore be written 
re ua 
» ON, ON; i V1 = 


Finally, putting 86/6N, = u, and multiplying by »,, we have 
2 Vip; = 0. (101.2) 


This is the required condition for chemical equilibrium. In order to obtain 
it, therefore, we must replace the symbols A, by the corresponding chemical 
potentials yu, in the equation of the chemical reaction. When several different 
reactions can occur in the mixture, the equilibrium condition will be a set of 
several equations such as (101.2), each obtained by the above method from 
the equation of the corresponding reaction. 

It may be noted that the condition (101.2) retains its form even when the 
reacting substances are distributed in the form of solutes in two different 
phases in contact. This follows from the fact that in equilibrium the chemical 
potentials of each substance in either phase must be equal, in accordance 
with the conditions for phase equilibrium. 


§ 102. The law of mass action ; 


Let us apply the general condition for chemical equilibrium, derived in 
§ 101, to reactions taking place in a gas mixture, assuming that the gas may be 
regarded as an ideal one. 

The chemical potential of each gas in the mixture is (see § 93) 


My = Tlog P.+4T); (102.1) 


where P, is the partial pressure of the ith gas in the mixture; P, = c,Pif P 
is the total pressure of the mixture and c, = N,/N is the concentration of the 
gas in question, defined as the ratio of the number N, of molecules of that gas 
to the total number N = N, of molecules in the mixture. 
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- It is now easy to write down the condition of chemical equilibrium for 
reactions in a mixture of gases. Substitution of (102.1) in (101.2).gives 


py Wf; = Ty », log Put) vi; = 0, 


where the Py, are the partial pressures of the gases in a state of chemical 


equilibrium. 
Using the notation 
KT) = e— el, (102.2) 
we thus have 
| [] Pet = K,(T). (102.3) 


Instead of Py, we can substitute Pco,, where the cy, are the concentrations of 
the gases in chemical equilibrium. Then 


I] ct, = P~"K,(T) = K.P, T). (102.4) 


The quantity on the right of (102.3) or (102.4) is a function only of tempera- 
ture and pressure, and does not depend on the initial amounts of the 
reacting gases: this quantity is called the chemical equilibrium constant, and 
the law expressed by formula (102.3) or (102.4) is called the Jaw of mass 
action. 

The dependence of the gas reaction equilibrium constant on the pressure 
is entirely determined by the factor P~*” on the right-hand side of equation 
(102.4); if the quantities of reacting substances are expressed in terms of their 
partial pressures, the equilibrium constant is independent of pressure. The 
determination of its dependence on temperature, however, requires further 
assumptions concerning the properties of the gases. 

For example, if the gases have constant specific heats, a comparison of the 
expression (102.1) with formula (43.3) for the thermodynamic potential of 
such a gas shows that the functions ¥,(7') are of the form 


X(T) = &:—C,T log T—-TE,, (102.5) 


where c,, is the specific heat and ¢, the chemical constant of the gas. Substi- 
tuting this expression in (102.2), we obtain for the equilibrium constant the 


formula 
KT) = eT ep%e—2e0/T , (102.6) 


which is essentially an exponential function of temperature. 

The law of mass action is valid also for reactions between solutes, provided 
that the solution may be regarded as weak. For the chemical potential of each 
solute has the form 

3 = T log ci+yi(P, T). (102.7) 
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The concentration c, is here defined as the ratio of the number of particles 
of the solute in question to the number of solvent particles (c, = n,/N). 
Substituting (102.7) in the equilibrium condition (101.2), we find in the same 
way 
[] cs = K(, T), (102.8) 
i 


with the equilibrium constant 
K(P, T) = e-2"T (102.9) 


Unlike the case of gas reactions, the dependence of the equilibrium constant 
on the pressure here remains indeterminate. 

If the reaction involves, as well as gases or solutes, substances in a pure 
condensed phase (i.e. not mixed with other substances), e.g. pure solids, then 
the equilibrium condition again leads to the law of mass action. Here, how- 
ever, since the chemical potential of the pure phases depends only on the 
pressure and temperature, the left-hand side of the equation for the law of 
mass action will not involve the quantities of the pure phases, i.e. the product 
of the concentrations of the gases (or solutes) must be written as if the solids 
were absent. The latter affect only the dependence of the equilibrium constant 
on pressure and temperature. 

If only gases and solids take part in the reaction, then, since the pressure 
of the gases is comparatively small, the chemical potential of the solids may 
be regarded as independent of the pressure, and the dependence of the equi- 
librium constant on the pressure remains the same as in (102.4). The sum 
2», in the exponent must of course denote only the sum of the coefficients of 
the gaseous substances in the reaction equation. 

Finally, the law of mass action is valid also for reactions in weak solutions 
where the solvent as well as the solutes takes part in the reactions. For, when 
its chemical potential is substituted in the condition for chemical equilib- 
rium, the small terms which contain the concentration may be omitted, and 
the potential then reduces to a quantity which depends only on temperature 
and pressure. Thus we again obtain the equation of the law of mass action, 
and its left-hand side again involves only the concentrations of the reacting 
solutes, not that of the solvent. 


PROBLEMS 


PROBLEM 1. Find the degree of dissociation of a diatomic gas at high tempera- 
tures, if the gas molecule consists of identical atoms and has no spin or orbital 
angular momentum in the ground state. 


SoLuTIoNn. The reaction has the form A, = 2A. In these Problems, we shall use 
the suffixes 1 and 2 to denote quantities relating respectively to the atomic (A) 
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and molecular (A,) components of the mixture. The degree of dissociation is defined 
as the ratio = N,/2Np of the number 4N, of dissociated molecules to the total 
number of molecules (that would exist in a non-dissociating gas) No = No +3N,. 
In accordance with the law of mass action (102.3), we have 


Py , NAN, +No) = 1—a? 


P27 PN2 4q2P 


whence 
a = [1+4PK,(T)]~. 


The equilibrium constant Kp is found by substituting in (102.6) the specific heats 
Cp = s, Cpe = 4 and the chemical constants 


C, = log [e,(m/2n?8?], Cf, = log [(1/#w) (m/m)?/7] 


(see (45.4), (46.4), (49.8)), where m is the mass of the atom A, and g, the statis- 
tical weight of the ground state of the atom A; at sufficiently high temperatures 
8, = 22S+1) @L+1), where S and L are the spin and orbital angular momen- 
tum of the atom. 

The result is 


8175/2 


ao /T 


where € = 2&9; —£o, is the dissociation energy of the molecule. 
PROBLEM 2. The same as Problem 1, but to find the specific heat. 


SOLUTION. The entropy of the gas may be calculated as the sum 


S=M, (x a: Eo1 — +N, (cn 4 =o 


— m(¢ ‘pl +2) +™(ent Fr) (Ny +22); 


the entropy of each component is expressed in terms of its chemical potential 
according to (43.6) and (43.3), and the equilibrium equation uw, = 2p, is used. 
Expressing N, and N, in terms of No and «, writing the chemical potential as 


Hy = €,+T log Py—c¢y,TlogT—C,T, Py = 2aP/(1+<), 


and substituting the values of c,; and cy», we obtain 


1— 1 
S= m| - = Eg t5 log T+ %—2 log Fig teonstant, (3) 


where the dissociation energy &9 is again used, and the constant represents the 
temperature-independent terms, which do not influence the required specific 


t See the footnote to § 104. 
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heat C, = T(0S/0T)p. From (1) we calculate the derivative 


ox\ = i d log Kp 
(sr), “74 ve ar 


_ A-aPa (3 7) 


OE ee oD 
with Kp from (2). Now differentiating the entropy (3), we have finally 


1 &  1\? 
Cp = 3 No [ota-tatt—ah (F + 2) | . 


PROBLEM 3. Determine the dependence of the concentration of hydrogen dis- 
solved as H atoms in a metal on the pressure of H, gas over the metal. 


SOLUTION. Regarding the process as a chemical reaction H, = 2H, we can 
write the equilibrium condition as uy, = 24H; 4H, is written as the chemical poten- 
tial of an ideal gas, uy, = T log P+¥4(7), and wy as that of the solute in a solution, 
fy = T log c+. Since p depends only slightly on the pressure (cf.§ 90), we find 
c = constant X 4/P. 


§ 103. Heat of reaction 


A chemical reaction is accompanied by the absorption or evolution of heat. 
In the former case the reaction is said to be endothermic, and in the latter 
case exothermic. It is evident that, if any particular reaction is exothermic, 
the reverse reaction will be endothermic, and vice versa. 

The amount of heat involved in a reaction depends on the conditions under 
which the reaction occurs. Hence, for instance, we must distinguish the heats 
of reaction at constant volume and at constant pressure (although the differ- 
ence is usually quite small). 

As in calculating the heat of solution(§ 91), we first find the maximum work 
which can be obtained by means of the chemical reaction. We call a reaction 
between one group of molecules as shown by the reaction equation an “ele- 
mentary reaction”, and calculate the change in the thermodynamic potential 
of a mixture of reacting substances when a small number 6n of elementary 
reactions take place, assuming that the reaction occurs at constant tempera- 
ture and pressure. We have 


o@ 
60 = Y¥' —— dN, = , ON;,. 
Day, ONE = 2m ON 


The change in the number of molecules of the ith substance after 6n elemen- 
tary reactions is clearly 6N, = — », én. Thus 


8D = —8n Ym. (103.1) 


In equilibrium 6®/6n is zero, as we should expect. 
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Formula (103.1) is the general expression for the minimum work which 
must be done in order to bring about 6n elementary reactions. It is also the 
maximum work which can be obtained from that number of reactions occur- 
ring in the reverse direction. 

Let us first suppose that the reaction is between gases. Using the expression 
(102.1) for u,, we find 


bb = ~on(TS log P+, rau)» 
or, in terms of the equilibrium constant, 


56 = T dn! — log P+ log K,(T)| 
L Z| 
= T onl —y, »;log e+ log KP, T) I. (103.2) 
L a 


For reactions in solution we similarly find, using the expressions (102.7) 

and (102.9), 
6 = T dnl —Y vlog c;-+log K(P, T)|. (103.3) 
me, oF -l 

The sign of 6® determines the direction in which the reaction takes place: 
since ® tends to a minimum, for 6® < 0 the reaction occurs in the forward 
direction (i.e. “from left to right” in the equation of the chemical reaction), 
while if 5© = 0 the reaction will actually go in the opposite direction in the 
mixture concerned. It may be noted, however, that the direction of the 
reaction is also evident directly from the law of mass action: we form the 
product JZP” for the mixture in question and compare it with the value of 
the equilibrium constant for the reaction. If, for instance, we find that ZP” > 
K,, this means that the reaction will occur in the forward direction, so as 
to reduce the partial pressures of the original substances (which have positive 
y, in the reaction equation), and increase those of the reaction products 
(for which », < 0). 

We can now determine also the heat absorbed (or evolved, according to 
sign), again for 6m elementary reactions. Formula (91.4) shows that this heat 
dQ, is, for a reaction at constant temperature and pressure, 


For reactions between gases we have, substituting (103.2), 


8 log K,(T) 


60, = —T? no 


(103.4) 
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Similarly, for solutions 
6Q, = —T? bn Glog K(P, T) : 


= (103.5) 


We may note that 6Q, is simply proportional to én and does not depend on 
the values of the concentrations at any instant. These formulae are therefore 
valid for any én, whether small or not. 

If Q, > 0, i.e. the reaction is endothermic, 0 log K/OT < 0, and the equi- 
librium constant decreases with increasing temperature. On the other hand, 
for an exothermic reaction (Q, < 0) the equilibrium constant increases with 
temperature. An increase in the equilibrium constant signifies that the chem- 
ical equilibrium is shifted back towards the re-formation of the initial sub- 
stances—the reaction goes “from right to left”, so as to increase the product 
ITcos. Conversely, a decrease in the equilibrium constant signifies a shift 
of the equilibrium towards formation of the reaction products. In other 
words, we can formulate the following rule: heating shifts the equilibrium in 
the direction of the endothermic process, and cooling in the direction of the 
exothermic process. This rule is entirely in agreement with Le Chatelier’s 
principle. 

For reactions between gases the heat of reaction at constant volume (and 
temperature) is also of interest. This quantity 5Q, is related in a simple 
manner to 6Q,. The quantity of heat absorbed in a process at constant vol- 
ume is equal to the change in the energy of the system, whereas 6Q, is equal 
to the change in the heat function. Since E = W— PY, it is clear that 60,, = 
= 60,—6(PV), or, substituting PV = TXN, and 6N, = — », 6n, 


60, = 80+T dn), ». (103.6) 


Finally, let us determine the change in volume of a mixture of reacting 
substances as a result of a reaction occurring at constant pressure (and tem- 
perature). For gases, the problem is trivial: 


8V = (T/P) 6N = —(T/P) én». (103.7) 


In particular, reactions in which the total number of particles is unchanged 
(Lv, = 0) occur without change of volume. 

For reactions in weak solutions we use the formula 6bV = 0 60/OP and, 
substituting (103.3), we obtain 
8 log K(P, T) 


OV = Ton ap 


(103.8) 


Thus a change in volume in the reaction is due to a pressure dependence of 
the equilibrium constant. In a similar way to the previous discussion of the 
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temperature dependence, we easily deduce that an increase in pressure fa- 
vours reactions in which the volume decreases (i.e. shifts the equilibrium in 
the direction of such reactions), and a decrease in pressure favours reactions 
which lead to an increase in volume, again in complete agreement with 
Le Chatelier’s principle. 


§ 104. Ionisation equilibrium 


At sufficiently high temperatures, collisions betwen gas particles may 
cause their ionisation. The existence of such thermal ionisation leads to the 
establishment of an equilibrium in which certain fractions of the total num- 
ber of gas particles are in various stages of ionisation. Let us consider ther- 
mal ionisation of a monatomic gas; this is the most interesting case, since 
chemical compounds are usually completely dissociated before the onset 
of thermal ionisation. 

Thermodynamically, ionisation equilibrium is a particular case of chem- 
ical equilibrium corresponding to a series of simultaneously occurring 
“ionisation reactions”, which may be written 


Ao= Ai+te", Az= Aegt+e,..., (104.1) 


where the symbol Ao denotes the neutral atom, Ai, Ag, ... the singly, doubly, 
etc., jlonised atoms and e~ the electron. For these reactions the application 
of the law of mass action gives the set of equations 


Cy_zfene = PKOYT) = (n= 1,2, ...), (104.2) 


where co is the concentration of neutral atoms, c1, C2, ... the concentrations 
of the various ions, and c the concentration of electrons (each defined as the 
ratio of the number of particles of the kind in question to the total number 
of particles, including electrons). To these equations we must add one which 
expresses the electrical neutrality of the gas as a whole: 


C= ¢44+2ce+3c34+ ... . (104.3) 


Equations (104.2) and (104.3) determine the concentrations of the various 
ions in ionisation equilibrium. 

The equilibrium constants K{” can be calculated without difficulty. All 
gases which take part in “reactions” (gases of neutral atoms, ions, or elec- 
trons) are monatomic and have constant specific heat c, = 3, and their 
chemical constants are € = log [g(m/27h*)*"], where m is the mass of a 
particle of the gas considered, and g the statistical weight of its ground state; 
for electrons, g = 2, while for atoms and ions g = (2L+1)(2S+1), where 
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L and S are the orbital angular momentum and spin of the atom or ion.’ 
Substituting these values in formula (102.6), we obtain the following expres- 
sion for the required equilibrium constants: 


oe 22 \38 Hh 
KOT) = Sale ron en (104.4) 
(M. N. Saha, 1921) where m is the electron mass and J, = &,—€&p,,1 the 
energy of the mth ionisation (nth ionisation potential) of the atom. 

The degree of n-fold ionisation of the gas becomes of the order of unity 
as the temperature increases and the equilibrium constant Ki = PK™ 
decreases to a value of the order of unity. It is very important to note that, 
despite the exponential dependence of the equilibrium constant on tempera- 
ture, this stage is reached not when T ~ [, but at considerably lower tem- 
peratures. The reason is that the coefficient of the exponential e”” is small: 
the quantity (P/T)(#"/mT)** = (N/V) (47/mT)*” is in general very small, 
being for T ~ J of the order of the ratio of the atomic yorum to the vo- 
lume V/N per atom in the gas. 

Thus the gas will be considerably ionised even at temperatures which are 
small compared with the ionisation energy, but the number of excited atoms 
in the gas will still be small, since the excitation energy of the atom is in 
general of the same order as the ionisation energy. 

When T becomes comparable with the ionisation energy, the gas is almost 
completely ionised. At temperatures of the order of the binding energy of 
the last electron in the atom, the gas may be regarded as consisting of elec- 
trons and bare nuclei only. 

The binding energy J; of the first electron is usually much less than the 
subsequent ones J,; there is therefore a range of temperatures in which the 
gas may be supposed to include only neutral atoms and singly charged ions. 
Defining the degree of ionisation « of the gas as the ratio of the number of 
ionised atoms to the total number of atoms, we have 


c=¢c,=a/(1l+«), co = (1—«)/ +2), 
and equation (104.2) gives (1—a”)/a? = PK™, whence 
a = Up +PKO). (104.5) 
This entirely determines the degree of ionisation as a function of pressure 


and temperature (in the temperature range considered). 


t For reasons given below we may assume that all atoms and ions are in the 
ground state, even in a considerably ionised gas. If the atom (or ion) ground state 
has a fine structure, we assume that 7 is large compared with the intervals in this 
structure. ’ 


x 
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§ 105. Equilibrium with respect to pair production 


At extremely high temperatures, comparable with the rest energy' mc? of 
the electron, collisions of particles in matter may be accompanied by the 
formation of electron-positron pairs. The number of particles itself then 
ceases to be a given quantity, and depends on the conditions of thermal 
equilibrium. 

Pair production (and the reverse process, annihilation) can be regarded 
thermodynamically as a “chemical reaction” e+ -+-e~ = y, where the sym- 
bols e+ and e~ denote a positron and an electron, and y denotes one or 
more photons. The chemical potential of the photon gas is zero (§ 63). 
The condition of equilibrium for pair production is therefore 


+ pt = 0, (105.1) 


where u~ and put are the chemical potentials of the electron and positron 
gases. It should. be emphasised that u here denotes the relativistic expression 
for the chemical potential, including the rest energy of the particles (cf. 
§ 27), which plays an important part in pair production. 

Even at temperatures T ~ mc*, the number of pairs formed per unit 
volume is very large in comparison with the atomic electron density (see 
the next footnote). We can therefore suppose with sufficient accuracy that 
the number of electrons is equal to the number of positrons. Then w~ = u*, 

- and the condition (105.1) gives u~ = wt = 0, i.e. in equilibrium the chemi- 
cal potentials of the electrons and positrons must be zero. 

Electrons and positrons obey Fermi statistics; their number is therefore 
obtained by integrating the distribution (56.3) with u = 0: 


= = Sd 


where é is determined from the relativistic expression ¢ = c ~/(p? +m?c?). 
For T « mc*, this number is exponentially small (oc e~"/"). In the 
opposite case (T >> mc?) we can put € = cp, and formula (105.2) gives 


V (T\380 x8dx 
+ — OS a Raton, PaaS 
aera = (ie) | err 


+ The energy mc? = 0.51108 eV, so that the temperature mc?/k = 6X 10° 
degrees. 
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The integral in this formula can be expressed in terms of the ¢ function (see 
the second footnote to § 58), giving’ 


= aie SOL ee, 
Nt =N- =) (Fe) V = 0.183(T ic V. (105.3) 


The energy of the positron and electron gases is similarly 


= TeV T*/120(fic®. (105.4) 


_ VT (T\8f dx 
(re) 


+— F- — Pieper S 
= - m0? e* +1 
This quantity is £ of the energy of black-body radiation in the same volume. 


PROBLEM 
Determine the equilibrium density of electrons and positrons for T < mc. 


SOLUTION. Using the expression (46.1a) for the chemical potential (to which 
mc? must be added), we obtain 


ntn~ = 4(mT/2nh8 e—*meX!T 


where n™ = N/V and nt = N+/V are the electron and positron densities. If no 
is the initial electron density (in the absence of pair production), then n= = n*+-np, 
and we find 


nt = nom = —gngt [ang ta(me/A)® (T/scmet) emer U2, 


t ForT ~ mc? the volume per pair formed is ~ (4i/mc)’. This volume is very small 
in comparison with the atomic volume (the cube of the Bohr radius, (4?/me?)*). 


CHAPTER XI 


PROPERTIES OF MATTER AT VERY 
HIGH DENSITY 


§ 106. The equation of state of matter at high density 


THE study of the properties of matter at extremely high density is of funda- 
mental importance. Let us follow qualitatively the change in these properties 
as the density is gradually increased. 

When the volume per atom becomes less than the usual size of the atom, 
the atoms lose their individuality, and so the substance is transformed into 
a highly compressed plasma of electrons and nuclei. If the temperature of 
the substance is not too high, the electron component of this plasma is a 
degenerate Fermi gas. An unusual property of such a gas has been mentioned 
at the end of § 57: it becomes more nearly ideal as the density increases. 
Thus, when the substance is sufficiently compressed, the interaction of the 
electrons with the nuclei (and with one another) becomes unimportant, and 
the formulae for an ideal Fermi gas may be used. According to (57.9) this 
occurs when n, > (m,e*/h*)> Z* holds, where n, is the number density of 
electrons, m, the electron mass, and Z some mean atomic number of the 
substance. We therefore find for the total mass density of the substance the 
inequality 


0 > (m.e"/h? m’Z? ~ 20Z? g/cm3, (106.1) 


where m’ is the mass per electron, so that 9 = n,m’. The “gas of nuclei” 
may still be far from degeneracy, because of the large mass of the nucleus, 
but its contribution to the pressure of the substance, for example, is in any 
case entirely negligible in comparison with that of the electron gas. 

Thus the thermodynamic quantities for a substance under the conditions 


t In all the numerical estimates given in this section it is assumed that the mean 
atomic weight of the substance is twice its mean atomic number, so that m’ is 
twice the nucleon mass. 

It may be mentioned that the degeneracy temperature of the electrons corre- 
sponding to a density 9 ~ 20Z? g/cm? is of the order of 10®Z4? degrees. 
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in question are given by the formulae derived in § 57, applied to the electron 
component. In particular, for the pressure we have' 


2\2/3 2 5/3 
P= Cry = (x) | (106.2) 


The condition (106.1) on the density gives for the pressure the numerical 
inequality P > 5x 10° Z!* bar. 

In the above formulae the electron gas is assumed non-relativistic. This 
implies that the Fermi limiting momentum p, is small compared with mc 
(see § 61), giving the numerical inequalities 


0<«<2X108 g/cm’, P< 10" bar. 


When the density and pressure of the gas become comparable with these 
values, the electron gas becomes relativistic, and when the opposite inequal- 
ities hold we have the extreme relativistic case, where the equation of state is 


determined by formula (61.4): 
= 73%)" fic(o/m’)*. (106.3) 


A further increase in density leads to states where nuclear reactions con- 
sisting in the capture of electrons by nuclei (with emission of neutrinos) are 
thermodynamically favoured. Such a reaction decreases the charge on the 
nucleus (leaving its atomic weight constant), and this in general causes a 
decrease in the binding energy of the nucleus, i.e. a decrease in its mass 
defect. The energy required to bring about such a process is more than 
counterbalanced at sufficiently high densities by the decrease in the energy of 
the degenerate electron gas because of the smaller number of electrons. 

It is not difficult to write down the thermodynamic conditions which 
govern the “chemical equilibrium” of the nuclear reaction mentioned, which 
may be symbolically written as 


Azte” = Az-it+?, 


where Az denotes a nucleus of atomic weight A and charge Z, e~ an elec- 
tron and » a neutrino. The neutrinos are not retained by matter and leave 


t Numerically, this formula gives 
P= 1.0x< 1018 (0/A’)8 dyn/cm? = 1.0% 107 (9/A’)*8 bar, (106.2a) 
where A’ = m/m,, is the atomic weight of the substance per electron (m, being the 


nucleon mass); 9 is measured in g/cm?. The corrections to (106.2) because of the 
Coulomb interaction of the particles have been discussed in § 80. 


f With the notation of (106.2a), 
P= 1.2 10° (0/ A’) bar. (106.3a) 
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the body; such a process must lead to a steady cooling of the body. Thus 
thermal equilibrium can be meaningfully considered in these conditions 
only if the temperature of the substance is taken as zero. The chemical poten- 
tial of the neutrinos will not then appear in the equation of equilibrium. The 
chemical potential of the nuclei is mainly governed by their internal energy, 
which we denote by —e, z (the term “binding energy” usually refers to the 
positive quantity e, 7). Finally, let u,(n,) denote the chemical potential of the 
electron gas as a function of the number density n, of particles in it. Then 
the condition of chemical equilibrium takes the form —e, 7+y,(n,) = 
—&4 7-1 OF, putting e4 7—&4,7-1 = 4, 


He(ne) = A. 


Using formula (61.2) for the chemical potential of an extreme relativistic 
degenerate gas, we thus find 
Ne = A3/3n7%(chy . (106.4) 
The equilibrium condition therefore gives a constant value of the electron 
density. This means that, as the density of the substance gradually increases, 
the nuclear reaction in question begins to occur when the electron density 
reaches the value (106.4). As the substance is further compressed, more and 
more nuclei will each capture an electron, so that the total number of elec- 
trons will decrease but their density will remain constant. Together with the 
electron density the pressure of the substance will also remain constant, 
being again determined mainly by the pressure of the electron gas: substitu- 
tion of (106.4) in (106.3) gives 
P = A*/127 (hic). (106.5) 


This will continue until each nucleus has captured an electron. 

At still higher densities and pressures the nuclei will capture further 
electrons, the nuclear charge being thus reduced further. Ultimately the nu- 
clei will contain so many neutrons that they become unstable and break up. 
At a density 9 ~ 3X10" g/cm? (and pressure P ~ 10% bar) the neutrons 
begin to be more numerous than the electrons, and when g ~ 10 g/cm® 
the pressure due to the neutrons begins to predominate (F. Hund, 1936). 
This is the beginning of a density region in which matter may be regarded 
as essentially a degenerate neutron Fermi gas with a small number of elec- 
trons and various nuclei, whose concentrations are given by the equilibrium 
conditions for the corresponding nuclear reactions. The equation of state 
of matter in this range is 

(327)? h2 
~ 5 mgs 


P p83 = 5,5 108 5 bar, (106.6) 


where m,, is the neutron mass. 
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Finally, at densities 9 >> 610% g/cm3, the degenerate neutron gas be- 
comes extreme-relativistic, and the equation of state is 


2\1/3 4/3 
p= OP te( 2.) = 1.2% 10° of bar. (106.7) 


It should be remembered, however, that at densities of the order of that of 
nuclear matter the specifically nuclear forces (strong interaction of nucleons) 
become important. In this range of densities formula (106.7) can be only 
qualitative. In the present state of our knowledge concerning strong inter- 
actions we can draw no definite conclusions concerning the state of matter 
at densities considerably above the nuclear value. We shall merely mention 
that in this range other particles besides neutrons may be expected to appear. 
Since particles of each kind occupy a separate group of states, the conversion 
of neutrons into other particles may be thermodynamically favoured because 
of the decrease in the limiting energy of the Fermi distribution of neutrons. 


§ 107. Equilibrium of bodies of large mass 


Let us consider a body of very large mass, the parts of which are held 
together by gravitational attraction. Actual bodies of large mass that are 
known to us, namely the stars, continuously radiate energy and are certainly 
not in thermal equilibrium. It is, however, of fundamental interest to discuss 
an equilibrium body of large mass. We shall neglect the effect of temperature 
on the equation of state, i.e. consider a body at absolute zero—a “cold” 
body. Since in actual conditions the temperature of the outer surface is 
considerably lower than the internal temperature, a discussion of a body 
with a constant non-zero temperature is in any case devoid of physical 
meaning. 

We shall further assume that the body is not rotating; then it will be 
spherical in equilibrium, and the density distribution will be symmetrical 
about the centre. 

The equilibrium distribution of density (and of the other thermodynamic 
quantities) in the body will be determined by the following equations. The 
Newtonian gravitational potential ¢ satisfies the differential equation Ad = 
4xGo, where og is the density of the substance and G the Newtonian con- 
stant of gravitation. In the case of spherical symmetry, 


1 d/,dd\ _ 
— ( a) = AnGo. (107.1) 


Moreover, in thermal equilibrium the condition (25.2) must be satisfied. In 
the gravitational field the potential energy of a particle of the body of mass 
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m’ is m’¢, and so we have 


+m’ = constant, (107.2) 


where for brevity the suffix zero is omitted from the chemical potential of 
the substance in the absence of the field. Expressing @ in terms of yu by 
means of (107.2) and substituting in (107.1), we have 


pes (* &) = —4nm'Go. (107.3) 


As the mass of the gravitating body increases, so of course does its mean 
density, as the following calculations will confirm. When the total mass 4 
of the body is sufficiently large, therefore, we can, as shown in§ 106, regard 
the substance as a degenerate electron Fermi gas, initially non-relativistic 
and then at still greater masses relativistic. 

The chemical potential of a non-relativistic degenerate electron gas is 
related to the density o of the body by 


3772 2/3 h2 
4 OF 


5) mms 0/8 > (107.4) 


see formula (57.3), with @ = m’N/V (m’ is the mass per electron and m, the 
electron mass). Expressing 9 in terms of u and substituting in (107.3), we 


havet 
7/2 y493/2 yap! 2 
oe (eS) = 2, eax 2 eet, (107.5) 
} 


ar ar a 


t It is easy to see that, for an electrically neutral gas consisting of electrons and 
atomic nuclei, the equilibrium condition can be written in the form (107.2) witih 
the electron chemical potential as 4 and the mass per electron as m’. For the deri- 
vation of this equilibrium condition (§ 25) involves considering the transport of an 
infinitesimal amount of substance from one place to another. In a gas consisting 
of both positively and negatively charged particles, such transport must be regard- 
ed as that of a certain quantity of neutral matter (i.e. electrons and nuclei togeth- 
er). The separation of the positive and negative charges is energetically very un- 
favourable, because of the resulting very large electric fields. We therefore obtain 
the equilibrium condition in the form 


HauctZ beat MguctZmayp = 0 


(with Z electrons per nucleus). Owing to the large mass of the nuclei (compared 
with that of the electrons) their chemical potential is very small compared with 1. 
Neglecting 4,y, and dividing the equation by Z, we obtain 


Met’ = 0. 


As in § 106, it will be assumed in the numerical estimates that m’ is twice the 
nucleon mass (7n’ = 2m,). 
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The physically meaningful solutions of this equation must not have singular- 
ities at the origin: « — constant for r—~ 0. This requirement necessarily 
imposes on the first derivative the condition 


du/dr=0 for r=0, (107.6) 


as follows immediately from equation (107.5) after integration over r: 
Af aner 
3 

A number of important results can be derived from equation (107.5) by 
simple dimensional considerations. The solution of (107.5) contains only two 
constants, 4 and (for instance) the radius R of the body, a knowledge of 
which uniquely defines the solution. From these two quantities we can form 
only one quantity with the dimensions of length, the radius R itself, and one 
with the dimensions of energy, 1/A?R* (the constant A having dimensions 
cm? erg"), It is therefore clear that the function u(r) must have the form 


ar) = aa! (x) (107.7) 


where f is some function of the dimensionless ratio r/R only. Since the den- 
sity @ is proportional to y®”, the density distribution must be of the form 


constant r 


Thus, when the size of the sphere varies, the density distribution in it 
remains similar in form, the density at corresponding points being inversely 
proportional to R°. In particular, the mean density of the sphere is inversely 


proportional to R°: 
@ oc 1/R®. 


The total mass M of the body is therefore inversely proportional to the cube 


of the radius: 
M co 1/R'. 


These two relations may also be written 
Ro M-¥, Go MM. (107.8) 


Thus the dimensions of an equilibrium sphere are inversely proportional to 
the cube root of its total mass, and the mean density is proportional to the 
square of the mass. The latter result confirms the assumption made above 
that the density of a gravitating body increases as its mass increases. 
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The fact that a gravitating sphere of non-relativistic degenerate Fermi gas 
can be in equilibrium for any total mass M can be seen a priori from the fol- 
lowing qualitative argument. The total kinetic energy of the particles in such 
a gas is proportional to N(N/V)** (see (57.6)), or, what is the same thing, 
to M*5"/R*, and the gravitational energy of the gas as a whole is negative and 
proportional to M?/R. The sum of two such expressions can have a minimum 
(as a function of R) for any M, and at the minimum R o M7", 

Substituting (107.7) in (107.5) and using the dimensionless variable = 
r/R, we find that the function f(£) satisfies the equation 


1d df\ _ 
aE (: #) = —f3n (107.9) 
with the boundary conditions f’(0) = 0, f(1) = 0. This equation cannot be 
solved analytically, and must be integrated numerically. It may be mentioned 
that as a result we find f(0) = 178.2, f’(1) = — 132.4. 

Using these numerical values it is easy to determine the value of the con- 
stant MR. Multiplying equation (107.1) by r? dr and integrating from 0 to R, 
we obtain 


GM = R*[d¢/dr],-r = —(R?/m’) [(du/dr),er = —f'(1)/m'2?R', 


whence 
MR = 91.9h8/G3m3m'5 = 2.2 10 (m,/m')® © km’, (107.10) 


where © = 2X10" is the Sun’s mass. Finally, the ratio of the central | 
density o(0) to the mean density @ = 3/M/4xR’ is easily found to be 


0(0)/o = —f9(0)/3f’(1) = 5.99. (107.11) 


Curve 1 in Fig. 50 (p. 324) shows the ratio 0(r)/e(0) as a function of r/R." 

Let us now examine the equilibrium of a sphere consisting of a degenerate 
extreme-relativistic electron gas. The total kinetic energy of the particles of 
such a gas is proportional to N(N/V)'* (see (61.3)), and hence to M“/R; the 
gravitational energy is proportional to —M?/R. Thus the two quantities de- 
pend on R in the same manner, and their sum will also be of the form con- 
stant/R. It follows that the body cannot be in equilibrium: if the constant is 
positive, the body will tend to expand until the gas becomes non-relativistic; 
if the constant is negative, a decrease of R to zero corresponds to a decreased 
total energy, i.e. the body will contract without limit. The body can be in 


In § 106 we have seen that matter may be regarded as a non-relativistic degener- 
ate electron gas at densities 9 => 20Z? g/cm’. If this inequality is satisfied for the 
mean density of the sphere considered, its mass must satisfy the condition M >> 
5x 107-8 ZQ. The corresponding radii are less than 5X 104 Z~¥3 km. 
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equilibrium only in the special case where the constant is zero, and the equi- 
librium is then neutral, the value of R being arbitrary. 

This qualitative argument is, of course, entirely confirmed by exact quanti- 
tative analysis. The chemical potential of the relativistic gas considered is 
related to the density by 


Lb = (32*)4 he(o/m’y'8 (107.12) 
(see (61.2)). Instead of (107.5) we now have 


1--d du 4Gm” 
ee eee 2 — =>=_—_ 3 — re ° 
( aut, = eae (107.13) 


Since 4 now has dimensions erg~? cm~2, we find that the chemical! potential 
as a function of r must be of the form 


1 r 
MO = ey A(z) (107.14) 
and the density distribution 
constant r 
a = Se F(R): 


Thus the mean density is inversely proportional to R?, and the total mass 
M co R°a is independent of R: 


60 1/R*, M=constant = Mp. (107.15) 
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Mp is the only value of the mass for which equilibrium is possible: for M > 
Mo the body will tend to contract indefinitely, and for M < M, it will ex- 
pand. 

For an exact calculation of the “critical mass” Mp, it is necessary to inte- 
grate numerically the equation 


1 d /,, df\ _ iat: 7 
we ( <) =-f, fM=0, f)=0, — (107.16) 
which is satisfied by the function f(é) in (107.14). The result is f(0) = 6.897, 
fC) = —2.018. For the total mass we find 


GMo = R'[dd/drl,-k = —f'()/m’ V2, 


whence 


3.1 (he 
Mo= Fa (G 


\" — 5,8(m,/m’? ©. (107.17) 


Putting m’ = 2m,, we find Mp = 1.45 ©. Finally, the ratio of the central 
density to the mean density is 0(0)/6 = —f°(0)/3f’(1) = 54.2. Curve 2 in 
Fig. 50 shows o(r)/0(0) in the extreme relativistic case as a function of 7/R.* 

The results obtained above concerning the relation between the mass and 
the radius of a “cold” spherical body in equilibrium can be represented by a 
single relation M = M(R) for all radii R. For large R (and therefore for 
small densities), the electron gas may be regarded as non-relativistic, and the 
function M(R) decreases as 1/R°. When R is sufficiently small, however, the 
density is so large that we have the extreme relativistic case, and the function 
M(R) is almost a constant Mp; strictly M(R) ~ Mo when R — O. Figure 51 
shows the curve M = M(R) calculated with m’ = 2m,.} It should be noted 
that the limiting value 1.45 © is reached only very gradually; this is because 
the density decreases rapidly away from the centre of the body, and so the 
extreme relativistic case may hold near the centre while the gas remains non- 
relativistic in a considerable part of the volume of the body. We may also 
mention that the initial part of the curve (R small) has no real physical 
significance: at sufficiently small radii the density becomes so large that nu- 
clear reactions begin to occur. The pressure will then increase with density less. 


t The formal problem of the equilibrium of a gravitating gaseous sphere with a 
power-law dependence of P on 0 was investigated by R. Emden (1907). The phys- 
ical deduction of the existence of the limiting mass and its value (107.17) was 
made by S. Chandrasekhar (1931) and L. Landau (1932). 

t The intermediate part of the curve is constructed by numerical integration of 
equation (107.3) with the exact relativistic equation of state for a degenerate gas; 
see § 61, Problem 3. 
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rapidly than o“*, and for such an equation of state no equilibrium is pos- 
sible.t 

Finally, this curve also has no meaning for large values of R (and small A): 
as has already been mentioned (see the second footnote to this section), in 
this range the equation of state used above becomes invalid. Here it should be 
pointed out that there is an upper limit to the possible size of a “cold” body, 


Fic. 51 


since on the curve in Fig. 51 large dimensions of the body correspond to small 
masses and small densities, but when the density is sufficiently small the sub- 
stance will be in the ordinary “atomic” state and will be solid at the low tem- 
peratures here considered. The dimensions of a body consisting of such a 
substance will obviously decrease as its mass decreases further, and not 
increase as shown in Fig. 51. The true curve R = R(M) must therefore have 
a maximum for some value of M. 

The order of magnitude of the maximum radius can easily be determined 
by noting that it must correspond to the density at which the interaction 
between electrons and nuclei becomes important, i.e. for @ ~ (m,e2/h)8 m’Z? 
(see 106.1)). Combining this with equation (107.10), we obtain 


Rinax ~ 42/G2emem’Z48 ~ 105m, /m’Z¥8 kin. (107.18) 


t If the chemical potential is proportional to a power of the density, 4 oC @ 
(andso P o o"*}), the internal energy of the body is proportional to Vo"*}, i.e. 
to M"+1/R"; the gravitational energy is again proportional to —M7?/R. It is easy 
to see that for n < -} the sum of two such expressions has an extremum as a func- 
tion of R, but this extremum is a maximum, not a minimum. 
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§ 108. The energy of a gravitating body . 
The gravitational potential energy E,. of a body is given by the integral 
Eg = +f e¢ av, (108.1) 


taken over the whole volume of the body. It will, however, be more conven- 
ient for us to start from a different expression for this quantity, which may 
be found as follows. Let us imagine the body to be gradually “built up” 
from material brought from infinity. Let M(r) be the mass of substance within 
a sphere of radius r. Let us suppose that a mass M(r) with a certain value of r 
has already been brought from infinity; then the work required to add a fur- 
ther mass dM(r) is equal to the potential energy of that mass (in the form of 
a spherical shell of radius r and thickness dr) in the field of the mass M(r), 
i.e. -GM(r) dM(r)/r. The total gravitational energy of a sphere of radius R 
is therefore 


Reve sg { ——— (108.2) 


Differentiation of the equilibrium condition (107.2) gives 


the differentiation must be at constant temperature, and (0u/0P), = vis the 
volume per particle. The derivative — d@/dr is the force of attraction on unit 
mass at a distance r from the centre, and equals —GM(r)/r?. Using also the 
density 9 = m’/v, we have 

dP GM(r) 


i 
Fare a (108.3) 


Expressing GM(r)/r from this in terms of dP/dr and writing dM(r) = 
o(r)-4ar? dr, we can put (108.2) in the form 


and then integrate by parts (bearing in mind that at the boundary of the 
body P(R) = 0 and that r3P + 0 as r — 0): 


R 
Eg =—12n [ PP dr =—3 [ Pdv. (108.4) 
0 
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Thus the gravitational energy of an equilibrium body can be expressed as an 
integral of the pressure over the volume. 

Let us apply this formula to the degenerate Fermi gases considered in§ 107. 
We make the calculation for the general case, and take the chemical potential 
of the substance to be proportional to some power of its density: 


p= Kel", (108.5) 
Since du = v dP = (m’/0) dP, we have 
P= — % guns, (108.6) - 


In the equilibrium condition u/m’ + = constant, the constant is just the 
potential at the boundary of the body, where u vanishes; this potential is 
—GM/R (M = M(R) being the total mass of the body), and so we can 
write 


We substitute this expression in the integral (108.1) which gives the gravi- 
tational energy, and use formulae (108.5), (108.6), obtaining 


fate Se [ uoar SR eay = "5 (pav-SE 


2m’ 2R 2 2R 


Finally, expressing the integral on the right in terms of E,, by (108.4), we 
have 
3 GM 


Eu —3—a R 


(108.7) 
Thus the gravitational energy of the body can be expressed by a simple for- 
mula in terms of its total mass and its radius. 

A similar formula can also be obtained for the internal energy E of the 
body. The internal energy per particle is u— Pv (for zero temperature and 
entropy); the energy per unit volume is therefore 

1 _ Op 
where (108.5) and (108.6) have been used to derive the second equation. 
The internal energy of the whole body is therefore 


1 n GM 
Ban{ Pay =—3nkq = a ae (108.8) 
Finally, the total energy of the body is 
Exot = E+Egr = — oe (108.9) 


5—n R 
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For a non-relativistic degenerate gas n = 2, and sot 


6 Git 3 GM 3 GM 
Ege — 7 ~R_ ’ a —R- r Etot = ae “R- . (108.10) 
In the extreme relativistic case, n = 3, so that 
‘Me? 
E,=—-E = -$ = Ext = 0. (108.11) 


The total energy is zero in this case, in accordance with the qualitative argu- 
ments given in§ 107 concerning the equilibrium of such a} 1dy.+ 


§ 109. Equilibrium of a neutron sphere 


For a body of large mass there are two possible equilibrium states. One cor- 
tesponds to the state of matter consisting of electrons and nuclei, as assumed 
in the numerical estimates in § 107. The other corresponds to the “neutron” 
state of matter, in which almost all the electrons have been captured by pro- 
tons and the substance may be regarded as a neutron gas. When the body is 
sufficiently massive, the second possibility must always become thermody- 
namically more favourable than the first (W. Baade and F. Zwicky, 1934). 
Although the transformation of nuclei and electrons into free neutrons in- 
volves a considerable expenditure of energy, when the total mass of the body 
is sufficiently great this is more than counterbalanced by the release of grav- 
itational energy owing to the decrease in size and increase in density of the 
body. 

First of all, let us examine the conditions in which the neutron state of a 
body can correspond to any thermodynamic equilibrium (which may be 
metastable). To do this, we start from the equilibrium condition u+m,¢ = 
constant, where yu is the chemical potential (the thermodynamic potential per 
neutron), m, the neutron mass, and ¢ the gravitational potential. 

Since the pressure must be zero at the boundary of the body, it is clear that 
in an outer layer of the body the substance will be at low pressure and 


t In this case 2E = —E,,, in agreement with the virial theorem of mechanics, 
applied to a system of particles interacting according to Newton’s law; see Mecha- 
nics,§ 10. 

+ In order to avoid misunderstanding, it may be mentioned that the energy F 
{and therefore E,,, in (108.11)) includes the rest energy of the particles (which pro- 
duce the pressure P). If E,,, is defined as the ‘‘binding energy” of the body (with 
the energy of the matter dissipated through space taken as zero), the rest energy 
of the particles must be subtracted from it. 
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density and will therefore consist of electrons and nuclei. Although the 
thickness of this “shell” may be comparable with the radius of the dense 
inner neutron “core”, the density of the outer layer is much lower, and so its 
total mass may be regarded as small compared with the mass of the core. 

Let us compare the values of 4 +-m,@ at two points: in the dense core near 
its boundary and near the outer boundary of the shell. The gravitational 
potential at these points may be taken as —GM/R and —GM/R’, where 
Rand R’ are the core and shell radii, and M the mass of the core, which in the 
approximation used here is equal to the total mass of the body. The chemical 
potential at both points is determined mainly by the internal energy (bind- 
ing energy) of the corresponding particles, which is large compared with their 
thermal energy. The difference between the two chemical potentials may there- 
fore be taken as simply equal to the difference between the rest energy of a 
neutral atom (i.e. a nucleus and Z electrons) per unit atomic weight and the 
rest energy of the neutron; let this quantity be denoted by 4. Then, equating 
the values of 4-+m,@ at the two points considered, we have 

1 1 
mG @) afi. 

Hence, whatever the radius R’, the mass and radius of the neutron core must 


satisfy the inequality 
m,MG/R > A. (109.1) 


Applying the results of § 107 to a spherical body consisting of a degenerate 
(non-relativistic) neutron gas, we find that M and R are related by 


MR® = 91.948/G3m8 = 3.6X 10° © km! (109.2) 


(formula (107.10) with m, and m’ replaced by m,). Hence expressing M in 
terms of R and substituting in (109.1), we obtain an inequality for M, which 
in numerical form is M > ~ 0.2©. For example, with 4 for oxygen we get 
M > 0.170, and for iron M > 0.18©. These masses correspond to radii 
R < 26km.t 

This inequality gives a lower limit of mass, beyond which the neutron 
state of the body cannot be stable. It does not, however, ensure complete 
stability; the state may be metastable. To determine the limit of metastabil- 
ity, we must compare the total energies of the body in two states: the neu- 
tron state and the electron—nucleus state. The conversion of the whole mass M 


t There is, of course, no sharp boundary between the “core” and the bell and 
the transition between them is continuous. 

t It must be emphasised that no literal astrophysical significance is to be atiabhed 
to the numerical estimates in this section, which are based on simple assumptions 
about the structure of the body. . 
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from the electron-nucleus state to the neutron state requires an expenditure 
of energy MA/m, to counterbalance the binding energy of the nuclei. In the 
process, energy is released because of the contraction of the body; according 
to formula (108.10), this gain of energy is 

3GM? / 1 1 

7 (Re) 
where R, is the radius of the body in the neutron state, given by formula 
(109.2), and R, its radius in the electron—nucleus state, given by (107.10). 
Since R, >> R,, the quantity 1/R, may be neglected, and we obtain the follow- 
ing sufficient condition for complete stability of the neutron state of the 
body (omitting the suffix in R,): 


3GMm,/7R > A. (109.3) 


Comparing this condition with (109.1) and using (109.2), we see that the 
lower limit of mass determined by the inequality (109.3) is greater by a factor 
(7/3)°4 = 1.89 than that given by (109.2). Numerically, the limit of metasta- 
bility of the neutron state is therefore at a mass M = 4© (and radius 
R = 22km).* 

Let us now consider the upper limit of the range of mass values for which 
a neutron body can be in equilibrium. If we were to use the results of § 107 
(formula (107.17), with m, in place of m’), the value obtained for this limit 
would be 6 ©. Inreality, however, these results are not applicable here, for 
the following reason. In a relativistic neutron gas, the kinetic energy of the 
particles is of the order of, or greater than, the rest energy, and the gravi- 
tational potential ¢ ~ c?.+ In consequence it is no longer valid to use the 
Newtonian gravitational theory, and the calculations must be based on the 
general theory of relativity; and, as we shall see later, we find that the extreme 
relativistic case is no longer reached. The calculations must therefore make 
use of the exact equation of state of a degenerate Fermi gas; see§ 61, Prob- 
lem 3. 

The calculations are effected by numerical integration of the equations of 
a spherically symmetric static gravitational field, and the results are as fol- 
lows.! 


t The mean density of the body is then 1.4X 10 g/cm’, and so the neutron gas 
may in fact still be regarded as non-relativistic, and the formulae used here are 
still valid. 

t In the relativistic electron gas, the kinetic energy of the particles is comparable 
with the rest energy of the electrons, but is still smal! in comparison with the rest 
energy of the nuclei, which contribute most of the mass of the substance. 

$ The details of the calculations are given in the original paper by J. R. Oppen- 
heimer and G. M. Volkoff, Physical Review 55, 374, 1939, 
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The limiting mass of a neutron sphere in equilibrium is found to be only 
Monax = 9-760, and this value is reached at a finite radius R,i, = 9.4 km. 
Figure 52 shows a graph of the relation obtained between the mass M and 
the radius R. Stable neutron spheres of larger mass or smaller radius cannot, 
therefore, exist. It should be mentioned that the mass M here denotes the 
product M = Nm,, where N is the total number of particles (i.e. neutrons) 
in the sphere. This quantity is not equal to the gravitational mass M,, of the 
body, which determines the gravitational field created by it in the surround- 
ing space. Because of the “gravitational mass defect”, in stable states we 
always have M,, < M (in particular, for R = Ruins Mg, = 0.95M).' 


Fic. 52 


The question arises of the behaviour of a spherical body of mass exceeding 
M.,,,- It is clear a priori that such a body must tend to contract indefinitely. 


max* 
The nature of this unrestrained gravitational collapse has been described in 


Fields, §§ 102-104. 

It should be noted that the possibility in principle of gravitational collapse, 
which (for the model considered of a spherical body) is unavoidable for 
M > M.,ax is not in fact restricted to large masses. A “collapsing” state 
exists for any mass, but for M < M,,,, it is separated by a very high energy 
barrier from the static equilibrium state.t 


t The point R = Ryn in Fig. 52 isin fact a maximum on the curve M = M(R). 
This curve continues beyond the maximum as an inward spiral which asymptot- 
ically approaches a centre. The parameter which increases monotonically along 
the curve is the density at the centre of the sphere, which tends to infinity for a 
sphere corresponding to the limiting point of the spiral(N. A. Dmitriev and S. A. 
Kholin, 1963). However, no part of the curve for R < Ry, corresponds to a stable 
state of the sphere. For an account of the investigationsee N. A. Dmitrievand S. A. 
Kholin, Voprosy kosmogonii 9, 254, 1963; B. K. Harrison, K. S. Thorne, 
M. Wakano and J. A. Wheeler, Gravitation Theory and Gravitational Collapse, 
University of Chicago Press, Chicago, 1965. 

t See Ya. B. Zel’dovich, Soviet Physics JETP 15, 446, 1962. 


CHAPTER XII 


FLUCTUATIONS 


§ 110. The Gaussian distribution 


IT HAS already been stressed several times that the physical quantities which 
describe a macroscopic body in equilibrium are, almost always, very nearly 
equal to their mean values. Nevertheless, deviations from the mean values, 
though small, do occur (quantities are said to fluctuate), and the problem 
arises of finding the probability distribution of these deviations. 

Let us consider some closed system, and let x be some physical quantity 
describing the system as a whole or some part of it (in the former case x must 
not, of course, be a quantity which is strictly constant for a closed system, 
such as its energy). In what follows it will be convenient to suppose that the 
mean value x has already been subtracted from x, and so we shall everywhere 
assume that ¥ = 0. 

The discussion in§ 7 has shown that, if the entropy of a system is formally 
regarded as a function of the exact values of the energies of the subsystems, 
the function e% will give the probability distribution for these energies (for- 
mula (7.17)). It is easy to see, however, that the discussion made no use of any 
specific properties of the energy. Similar arguments will therefore show that 
the probability for a quantity x to have a value in the interval from x to 
x+dx is proportional to e°™, where S(x) is the entropy formally regarded 
as a function of the exact value of x. Denoting this probability by w(x) dx, 
we have! 


w(x) = constantx eS@), (110.1) 


Before proceeding to examine the consequences of this formula, let us con- 
sider its range of applicability. All the arguments leading to formula (110.1) 
tacitly assume that the quantity x behaves classically.t We must therefore 
find a condition which ensures that quantum effects are negligible. 


t This formula was first applied to the study of fluctuations by A. Einstein 
(1907). 

t This does not mean, of course, that the whole system must be a classical one. 
Variables other than x pertaining to the system may have quantum behaviour. 
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As we know from quantum mechanics, the relation AE Ax ~ Ax exists 
between the quantum uncertainties of energy and of some quantity x, x being 
the classical rate of change of x (see Quantum Mechanics, § 16). 

Let t be a time’ expressing the rate of change of the quantity x which is 
considered here and which has a non-equilibrium value; then x ~ x/t, and 
so AE Ax ~ hx/v. Itisclear that the quantity x may be said to have a definite 
value only if its quantum uncertainty is small: 4x « x, whence AE > h/t. 
Thus the quantum uncertainty of energy must be large in comparison with 
A/t. The entropy of the system will then have uncertainty AS >> A/T. 

If formula (110.1) is to be meaningful, it is clearly necessary for the uncer- 
tainty of entropy to be small compared with unity: 


Tht, 1 KIT. (110.2) 


This is the required condition. When the temperature is too low or when the 
quantity x varies too rapidly (zx is too small) the fluctuations cannot be treated 
thermodynamically, and the purely quantum fluctuations become of major 
importance. 

Let us now return to formula (110.1). The entropy S has a maximum for 
x = ¥ = 0. Hence 0S/dx = 0 and 02S/dx? < 0 for x = 0. In fluctuations 
the quantity x is very small. Expanding S(x) in powers of x and retaining 
only terms of up to the second order, we obtain 


S(x) = S(0)—48, (110.3) 


where f is a positive constant. Substitution in (110.1) gives the probability 
distribution in the form 


w(x) dx = Ae—** dx. 


The normalisation constant A is given by the condition f w(x) dx = 1; 
although the expression for w(x) is valid for small x, the integrand de- 
creases so rapidly with increasing |x| that the range of integration may be 
extended from — © to co. The integration gives A = +/(8/2n). 

Thus the probability distribution of the various values of the fluctuation 
x is given by the formula 


w(x) dx = 4/(B/2m)e—*6* dx. (110.4) 


t The time t need not be the same as the relaxation time for equilibrium to be 
reached with respect to x, and may be less than this time if x approaches ¥ in an 
oscillatory manner. For example, if we consider the variation of pressure in a 
small region of the body (with linear dimensions ~ a), t will be of the order of the 
period of acoustic vibrations with wavelength 4 ~ a, i.e. t ~ a/c, wherec is the 
velocity of sound. 
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Such a distribution is called a Gaussian distribution. It has a maximum when 
x = Oand decreases rapidly and symmetrically as |x| increases on either side 
of the maximum. 

The mean square fluctuation is 


xe | x*w(x) dx = 5: (110.5) 


Thus we can write the Gaussian distribution in the form 
w(x) dx = ee ex ae dx. (110.6) 
= sreansy °° (~ zea) 3 | 


As we should expect, the’ smaller (x*), the sharper is the maximum of w(x). 
Knowing the mean square (x?), we can find the corresponding quantity 
for any function $(x). Since x is small, we have? 


((4)*) = [(db/dx)"]xm0 (x*). (110.7) 


§ 111. The Gaussian distribution for more than one variable 


In§ 110 we have discussed the probability of a deviation of any one thermo- 
dynamic quantity from its mean value, disregarding the values of other 
quantities.t In a similar manner we can determine the probability of a si- 
multaneous deviation of several thermodynamic quantities from their mean 
values. Let these deviations be denoted by xj, Xp, ..-, X,- 

We define the entropy S(x,, ..., x,) as a function of the quantities x,, 
X_, ..., X, and write the probability distribution in the form w dx, ... dx,, 
with w given by (110.1). Let S be expanded in powers of the x,; as far as the 
second-order terms, the difference S—So is a negative-definite quadratic 
form: 


n 
S—So=—} ) x 
0 ay 4 Bik Xk 


(clearly B,, = B,,). In the rest of this section we shall omit the summation 
sign, and summation from 1 to n over all repeated suffixes will be implied. 


Thus we write 
S—So = — FB iXiXn- (111.1) 


t It is, of course, implied that the function @(x) does not vary greatly for values 
of x ~ (x*)'”, and that the derivative dp/dx is not zero at x = 0. 


t This means that the function S(x) used in§ 110 was the greatest possible value 
of the entropy for the given non-equilibrium value of x. 


23 


336 Fluctuations 


Substituting this expression in (110.1), we obtain for the required probability 
distribution 
w= Aether, (111.2) 


The constant A is determined from the normalisation condition 
fwdx1... dx, = 1. 


Here, for the same reason as in § 110, the integration over each x, can be 
taken from — © to . To calculate this integral we proceed as follows. The 


linear transformation 
Xp = AipX; (111.3) 


Of x1, Xo) ..., X, converts the quadratic form 6,.x,x, into a sum of squares 
x;x;. In order that 
BieX Xe = XiX = XiXKOin 
should be valid, the transformation coefficients must satisfy the relations 
Bir@idem = O1m- (111.4) 
The determinant of the matrix of quantities on the left of this equation is the 


product of the determinant 8 = |8,| and two determinants a = |a,|. The 
determinant | 6,,| = 1. The above relation therefore shows that 


Ba? = 1. (111.5) 
The Jacobian of the linear transformation from the variables x, to the x, is 
a constant, the determinant a. After the transformation, therefore, the nor- 


malisation integral separates into a product of 7 identical integrals and, 
using (111.5), we find 


a [ew (->") al = Jy ene = |, 


m_ OO 


Thus we obtain finally the Gaussian distribution for more than one vari- 
able in the form 


w= ie exp (—zPuxm) (111.6) 


Let us define the quantities 
X; = —0S/0x; = Birxe, (111.7) 
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which we refer to as thermodynamically conjugate to the x,;' and let us 
determine the mean values of the products x,X,: 


_ vB ( [ _i 

(xiXK) = nye | oe) XP xix EXP ( 3 Bux] dx1... dx,. 
To calculate the integral, let us assume for the moment that the mean 
values x, are not zero but X,9, say. Then in (111.6) x, must be replaced by 
X;—Xjo, and the definition of the mean gives 


x= ae [ sine [x exp | 5 Ba(xi—x0) (m1) dx;... dx, = X. 


v v ~ 


Differentiating this equation with respect to x, 9 and then again putting all 
the x9 equal to zero, we have 6, on the right and the required integral on 
the left. 


Thus we find 
(xX %) = On (111.8) 


Substituting (111.7) gives B,<x;%,) = 5%, whence 
(Xin) = Bes (111.9) 


where £~}, is an element of the matrix inverse to £,,. 
Lastly, let us determine (X,X,). According to (111.7) and (111.8), 


(XX) = BikxiX~) = BiOms 
(XiXk) = Bir- (111.10) 


i.e. 


It is easy to determine also the mean square fluctuation of any function 
G(x, -.., X,)- Since the deviations from the mean values are small, 4¢ = 
(Op/0x,) 4x, where the 0¢/0x, denote the values of the derivatives for 
xXy= ... = x, = 0. Hence 

Op O¢ op O¢ 


(4g) = Ox; Bx, re ~ Bx, Ox, ° (111.11) 


If the fluctuations of any two of the x, (x, and x2, say) are statistically 
independent, the mean value (x1x2) is equal to the product of the mean 


t Note that, with the linear relation (111.7), the conjugacy is reciprocal: if the 
same entropy is expressed in terms of the quantities X;, then 


x; =—dS/0X;, (111.7a) 
since, from (111.7), 


dS = —X, dx, = —Byix, dx, = — x, AB yx) = —x, Xj. 
230 
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values x; and X33 ne each of these is zero, so is (x1%2), and from (111.9) 
this implies that Bj, = 0. It is easy to see that for a Gaussian distribution 
the converse theorem is also valid: if (x,x.) = 0 (i.e. B7}, = 0), the fluc- 
tuations of x, and xq are statistically independent. For the probability 
distribution wi. of the quantities x; and x2 is obtained by integrating the 
distribution (111.6) over all the other x,; the result is an expression of the 
form 


12 = constant X exp { — Bi x4— BigX1%_— 5B ogX3} 


(where the coefficients B,, are in general different from the es 
Bu.) Applying formula (111.9) to this distribution, we find that(x,x,) = 

If (x:%2) = 0, then 6’~}, = 0. But for a matrix of order two the ue 
of the inverse matrix element #’~1, implies that of the element 6;..1 Thus 
W12 separates into a product of two independent Gaussian distributions for 
the quantities x; and x2, which are therefore statistically independent. 


PROBLEM 


Determine the mean value (exp («,x,)), where the «, are constants and the ~x; 
are fluctuating quantities subject to the Gaussian distribution (111.2). 


SOLUTION. We have to calculate the integral 
Cexp (a,x) = 4 f exp (xx;— 2B uxi%n) 1X1 «+» Ap. 
By the transformation (111.3) the exponent in the integrand becomes 
1-3 BuXirn = Oy Xi — ZAIN 
= xp UP + eA» 
and the integration then gives 
exp (ax) = exp (3042004) 


According to (111.4), ay, = a7j,,87}, and so a,.ay, = B~}. Thus, using (111.9), we 
have finally 
Cexp (a;x,)) = exp (Faj044%)%4))- 


§ 112. Fluctuations of the fundamental thermodynamic quantities 


Let us now calculate the mean square fluctuations of the fundamental 
thermodynamic quantities, pertaining to any small part of a body. This small 
part must still, of course, contain a sufficient number of particles. At very 


t For a matrix of order two we have 


“ia = Bie/(Gin—BrsB 29). 
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low temperatures, however, this condition may be weaker than (110.2), 
which ensures that quantum fluctuations afte absent, as assumed; in this 
case the minimum permissible dimensions of the parts of the body will be 
determined by the latter condition. To avoid misunderstanding, it should 
be emphasised that the degree of importance of quantum fluctuations has no 
bearing on the influence of quantum effects on the thermodynamic quanti- 
ties (or equation of state) of the substance: the fluctuations may be classical 
while at the same time the equation of state is given by the formulae of quan- 
tum mechanics. 

For quantities such as energy and volume, which have a purely mechanical 
significance as well as a thermodynamic one, the concept of fluctuations is 
self-explanatory, but it needs more precise treatment for quantities such as 
entropy and temperature, whose definition necessarily involves considering 
the body over finite intervals of time. For example, let S(E, V) be the equi- 
librium entropy of the body as a function of its (mean) energy and (mean) 
volume. By the fluctuation of entropy we shall mean the change in the func- 
tion S(E, V), formally regarded as a function of the exact (fluctuating) values 
of the energy and volume. : 

As we have seen in the preceding sections, the probability w of a fluctua- 
tion is proportional to e*, where S, is the total entropy of a closed system, 
i.e. of the body as a whole. We can equally well say that w is proportional 
to e“*', where JS, is the change in entropy in the fluctuation. According to 
(20.8) we have 4S, = —R,i,/To, where R,,i, is the minimum work needed 
to carry out reversibly the given change in the thermodynamic quantities 
in the small part considered (relative to which the remainder of the body 


acts as a medium). Thus 
w oc e~Rmin/To , (112.1) 


Here we substitute for R,,;, the expression 


where AE, AS, AV are the changes in the energy, entropy and volume of the 
small part of the body in the fluctuation, and J, Po the temperature and pres- 
sure of the medium, i.e. the equilibrium (mean) values of the temperature 
and pressure of the body. In what follows we shall omit the suffix zero from 
all quantities which are the coefficients of fluctuations; the equilibrium 
values will always be meant. Thus we have 


eer ee 


w oc exp ( _ T (112.2) 


Y For example, for pressure fluctuations the condition t >» #/T with t ~ a le 
(see the third footnote to § 110) gives a>» fic/T. 
« ¢ “7 ie ce a ue 
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We may note that in this form the expression (112.2) is applicable to any 
fluctuations, small or large (a large fluctuation here meaning one in which, 
for example, AE is comparable with the energy of the small part of the body, 
though of course still small compared with the energy of the whole body). 
The application of formula (112.2) to small fluctuations (which are those 
generally occurring) leads to the following results. 

Expanding AE in series, we obtain (cf. § a 


‘sass l 2 Bp 2 
AE-TAS +P AV = 2 ase (As¥ 4202 a 7 As Av + 2= (av) |. 
It is easily seen that this expression may be written as 


OE i 
x [454(S5) +4vA(sr) | = 5 (AS AT—AP AY). 


Thus we obtain the fluctuation probability (112.2) in the form 


w oc exp (a ; (1 12.3) 


From this general formula we can find the fluctuations of various thermo- 
dynamic quantities. Let us first take V and T as independent variables. Then 


_ [as as C, aP 
4s = (sr) ar+(ar) Av = & 7 4t+ (Sr) AV, 


oP . 
4P = (ap) AT + (sr), AV; 


see (16.3). Substituting these expressions in the exponent in formula (112.3), 
we find that the terms in AV AT cancel, leaving 


We exp | ~a7 40) + zp — (aw ) (av. (112.4) 


This expression separates into two factors, one depending only on AT and 
the other only on AV. In other words, the fluctuations of temperature and 
of volume are statistically independent, so that 


(AT AV) = 0. (112. 5) 
Comparing successively each of the two factor of (112.4) with the general 


formula (110.6) for the Gaussian distribution, we find the following expres- 
sions for the mean square fluctuations of temperature’ and volume: 


(AT) = T4Cy, (112.6) 
(AV)) =—T(OV/OP)r. (112.7) 


t If T is measured in degrees, ((AT)*) = kT?/C,,. 
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These quantities are positive by virtue of the thermodynamic inequalities 
C, > 0 and (OP/dV), < 0. 
Let us now take P and S as the independent variables in (112.3). Then 


ov 
AV = (s), AP+(s5) AS, 


ar Or 
AT = (as), As+ (sp) AP =~ = 48+ (ap). AP. 


From the formula dW = TdS+V dP, 
(OV /0S)p = 0?'W/OP OS = (OT/OP)s, 


AV = (a). AP+(sp). AS. 
Substitution of AV and AT in (112.3) gives 
w oc exp {or a (; oP), (4PP—sR- z, (4sy (112.8) 


As in (112.4), this expression is a product of factors, one depending only 
on 4P and the other only on AS. In other words, the fluctuations of entropy 
and of pressure are statistically independent,’ so that 


and so 


(AS AP) = 0. (112.9) 

For the mean square fluctuations of entropy and pressure we find 
(AS}) = Cp, (112.10) 
(4P)*) = —T(OP/OV)s. (112.11) 


These formulae show that the mean square fluctuations of additive thermo- 
dynamic quantities (volume and entropy) are proportional to the dimensions 
(the volume) of those parts of the body to which they relate. Accordingly the 
root-mean-square fluctuations of these quantities are proportional to the 
square root of the volume, and the relative fluctuations are inversely propor- 
tional to this square root, in agreement with the general results in § 2 (for- 
mula (2.5)). But for quantities such as temperature and pressure the r.m.s. 
fluctuations themselves are inversely proportional to the square root of 
the volume. 

Formula (112.7) determines the fluctuation of the volume of some part of 
the body, containing a certain number N of particles. Dividing both sides by 


* The statistical independence of the pairs of quantities 7, V and S, P is already 
obvious from the following argument. If we take x, = AS and x, = AV as the x; 

in the formulae in § 111, the corresponding YX, will be ¥, = AT/T, X, = —AP/T 
(see § 22). But (x;X,) = 0 for i * k according to the general formula (111.8), and 
this gives (112.5) and (112.9). 
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N?, we find the volume fluctuation per particle: 
T (OV 
A(VIN))*) = —x> (ap) , (112.12) 
«( Y) = — Ne lap e 


This fluctuation must obviously be independent of whether we consider it for 
constant volume or for constant number of particles. From (112.12) we can 
therefore find the fluctuation of the number of particles in a fixed volume in 
the body. Since V is then constant, we must put A(V/N) = V A(1/N) = 
—(V/N*) AN. Substitution in (112.12) gives 
((AN)*) = —(TN?/V*) (OV /OP)r. (112.13) 
For certain calculations it is convenient to write this formula in a different 
form. Since the derivative (OV/OP), is regarded as taken with N constant, we 


write 
N? (OV oN 


FS). Ge F),, 

The number of particles N, as a function of P, T and V, must be of the form 
N = V f(P, T),as shown by considerations of additivity (cf. § 24); that is, 
N/V is a function of P and T only, and it therefore does not matter whether 
NIV is differentiated at constant N or constant V. Hence we can write 


ne) = 2 (2) 
OP Viey V\OP},y 


2 oP) (5,) - ar) 
(a rv \SH/ py (a Begs 


where we have used the equation N/V = (OP/Ou)7, y, which follows from 
formula (24.14): d2 =—-V dP =—SdT—Ndp. Thus we have for the 
fluctuation of the number of particles the formulat 


(ANP) = T(ON/Op)r,y- (112.14) 


t This formula can also be easily derived directly from the Gibbs distribution. 
According to the definition of the mean value, 


N= eu g NeEN >, ean, 
n 


Differentiation of this expression with respect to] 4 (for constant V and T) gives 


ON 1 02 1 02 
ON etl 24) ouNITS e—EawT = — ((N%)\ 4.9 oe 
sre ¥(¥ rd Sees 7 (N +N =a) 
But 022/04. = —N, and so 
:) a a ke 
oN = ENR) = 2 (ANP, 


which gives (112.14). 
We could also use the Gibbs distribution to derive expressions for the fluctua- 
tions of the other thermodynamic quantities. 
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A body is characterised not only by the thermodynamic quantities consid- 
ered above but also by the momentum P of its macroscopic motion relative 
to the medium. In a state of equilibrium there is no macroscopic motion, i.e. 
P = 0. Motion may, however, result from fluctuation; let us determine the 
probability of such a fluctuation. The minimum work R,,,, in this case is 
simply equal to the kinetic energy of the body: Ry. = P?/2M = 4Mv*, 
where M is its mass and v = P/M the velocity of the macroscopic motion. 
Thus the required probability is 

w oc e~MetAar, (112.15) 


It may be noted that the fluctuations of velocity are statistically independ- 
ent of those of the other thermodynamic quantities. The mean square 
fluctuation of each Cartesian component of the velocity is equal to 


((40,)?) = T/M, (112.16) 


and is inversely proportional to the mass of the body. 

The foregoing formulae show that the mean square fluctuations of such 
quantities as energy, volume, pressure and velocity vanish at absolute zero 
(as the first power of the temperature). This is a general property of all 
thermodynamic quantities which also have a purely mechanical significance, 
but is not in general true of such purely thermodynamic quantities as entropy 
and temperature. 

Formula (112.6) for the fluctuations of temperature can also be interpreted 
from a different point of view. As we know, the concept of temperature 
may be introduced through the Gibbs distribution; it is then regarded as a 
parameter defining this distribution. As applied to an isolated body, the 
Gibbs distribution gives a complete description of the statistical properties, 
except for the inaccuracy that it leads to very small but non-zero fluctuations 
of the total energy of the body; these cannot in fact exist (see the end of 
§ 28). Conversely, if the energy is regarded as a given quantity, we cannot 
assign a definite temperature to the body, and we must suppose that the 
temperature undergoes fluctuations in accordance with (112.6), where C, 
denotes the specific heat of the body as a whole. This quantity (112.6) obvi- 
ously describes the accuracy with which the temperature of an isolated body 
can be defined. 


PROBLEMS 
PROBLEM 1. Find the mean square fluctuation of the energy (using V and T as 
independent variables). 
SOLUTION. We have 


AE= (5) Ay+ (=) 
T 


av ar), 


aP 
AT = I? (=r) 4 AV+C,AT. 
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Squaring and averaging, we obtain 
_ OP 2 (ov i 
((AEY) = In(57) nea (ap) +O : 


PROBLEM. 2. Find ((AW)?) (with variables P and 5S). 
SoLution. (AW)*) = —TV(OP/OV)5+T°C,. 
PROBLEM 3. Find (A7 AP) (with variables V and 7). 
SoruTion. (AT AP) = (T?/C,) (OP/OT)y. 

PROBLEM 4. Find (AV AP) (with variables V and 7). 
SotuTion. (AV AP) = —T. 

PROBLEM 5. Find (AS AV) (with variables V and 7). 
SoruTion. (AS AV) = (OV/OT)pT. 

PROBLEM 6. Find (AS AT’ (with variables V and T). 
Socution. (AS AT) = T. 


PROBLEM 7. Find the mean square fluctuation deviation of a simple pendulum 
suspended vertically. 

SOLUTION. Let / be the length of the pendulum, m its mass, and @ the angle 
of deviation from the vertical. The work R,jjn is here just the mechanical work done 
against gravity in the deviation of the pendulum; for small ¢, Ruin = +mg-[p?. 
Hence 


(p*) = T/me)l. 


PROBLEM 8. Find the mean square fluctuation deviation of the points of a 
stretched string. 


SOLUTION. Let / be the length of the string, and F its tension. Let us take a point 
at a distance x from one end of the string, and let y be its transverse displacement. 
To determine (y*) we must consider the equilibrium form of the string when the 
displacement y of the point x is given; this consists of two straight segments from 
the fixed ends of the string to the point x, y. The work done in such a deformation 
of the string is 


Raia = FLYV2+9)—x]+F Ly (U-2 + 9}-(-9)] = 2*(e+r3) 


x 


Thus the mean square is 


(x) = (T/FI) x(l—x). 


PROBLEM 9. Determine the mean value of the product of the fluctuation dis- 
placements of two different points of the string. 


SoLuTION. Let y;, y2 be the transverse movements of points at distances x,, X» 
from one end of the string (with x, > x,). The equilibrium form for given »,, y. 
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consists of three straight segments, and the work is 


1 ry Xo l-x, 1 
Ruin = — eee Se ee ce D Rees i 
me 2 F(3§ Pra es (—X_) (tg— 1) 2 =) 


from formula (111.8) we then have 
(¥¥e) = (1/FD x, (l—x9). 


§ 113. Fluctuations in an ideal gas 


The mean square fluctuation of the number of particles within some rela- 
tively small volume in an ordinary ideal gas is found by substituting V = 
NT/P in formula (112.13). This gives the simple result 


(ANY?) = N. (113.1) 


The relative fluctuation of the number of particles is therefore just the recip- 
rocal square root of the mean number of particles: 


(any _ 1 
N VN’ 


In order to calculate the fluctuation of the number of particles in an ideal 
Bose or Fermi gas, we must use formula (112.14), with the expression (56.5) 
substituted for N as a function of yu, 7, V, obtained by integrating the corre- 
sponding distribution function. We shall not pause to write out here the fairly 
lengthy expressions which result, but simply note the following point. We 
have seen that, in a Bose gas at temperatures T < Tp (see § 62), the pressure 
is independent of the volume, i.e. the compressibility becomes infinite. 
According to formula (112.13) this would imply that the fluctuations of 
number of particles also become infinite. This means that, in calculating 
fluctuations in a Bose gas, the interaction between its particles cannot be 
neglected at low temperatures, however weak this interaction may be. When 
the interaction, which must exist in any actual gas, is taken into account, 
the resulting fluctuations are finite. 

Next, let us examine fluctuations in the distribution of the gas particles 
over the various quantum states. We again consider the quantum states of 
the particles (including different states of their translational motion); let n, 
be their occupation numbers. 

Let us consider an assembly of 7, particles in the kth quantum state. Since 
this set of particles is statistically independent of the remaining particles in 
the gas (cf.§ 37), we can apply formula (112.14) to it: 

On; 


((An,)?) = TSE. (113.2) 
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For a Fermi gas we must substitute 


Joa: 1 
Tk = eat 1 


The differentiation gives 


((4iny)?) = my —1g). (113.3) 
Similarly, for a Bose gas 
((An,)*) = m1 + nx) (113.4) 
For a Boltzmann gas the substitution n, = e“~ naturally gives 
((An,)*) = nk (113.5) 


which is obtained from both (113.3) and (113.4) when n, « 1. 

Summation of (113.3) or (113.4) over a group of G, neighbouring states 
containing altogether N, = Z», particles gives, by virtue of the statistical 
independence (already mentioned) of the fluctuations of the various m, 


(AN) = Gin FH) = N/(1EN/G), (113.6) 


where 7, is the common value of the neighbouring 7, and N, = n,G,. 

These formulae can be applied, in particular, to black-body radiation (an 
equilibrium Bose gas of photons), for which we must put w = 0 in (113.4). 
Let us consider the set of quantum states for the photon (in a volume V) with 
neighbouring frequencies in a small interval 4w,. The number of such states 
is G, = Vio; Aw,/x*c*, see (63.3). The total energy of the quanta in this fre- 
quency interval is E,,, = Njhw,. Multiplying (113.6) by (4,)? and omitting 
the suffix j, we obtain the following expression for the fluctuation of the 
energy E,,,, of black-body radiation in a given frequency interval Jw: 


(AE 4u)*) = f00 Egy +7704 E4o)"/Vieo* Aco, (113.7) 
a relation first derived by A. Einstein (1909). 


PROBLEM 


Determine ((4N)*) for an electron gas at temperatures much lower than the 
degeneracy temperature. 


SoLuTion. In calculating (ON/Oy)z7, y we can use the expression (57.3) for yu at 
absolute zero. A simple calculation gives 
338mT /N 
(4N ¥) = or ee (5) 
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§ 114. Poisson’s formula 


Knowing the mean square fluctuation of the number of particles in a given 
volume of gas (113.1), we can write down the corresponding Gaussian proba- 
bility distribution for fluctuations in this number of particles: 


oe dN. (114.1) 


w(N) dN = Vln) exp ON 


This formula is, however, valid only for small fluctuations: the deviation 
N—WN must be small compared with the number N itself. 

If the volume selected in the gas is sufficiently small, the number of parti- 
cles in it is small, and we may also consider large fluctuations, where N—N 
becomes comparable with N. This problem is meaningful only for a Boltz- 
mann gas, since in a Fermi or Bose gas the probability of such fluctuations 
can become appreciable only in volumes so small that quantum fluctuations 
become important. 

The solution of this problem is most simply found as follows. Let Vo and 
No be the total volume of the gas and the number of particles in it, and V a 
part of the volume, small compared with Vo. Since the gas is uniform, the 
probability that any given particle is in the volume V is obviously just the 
ratio V/V, and the probability that N given particles are simultaneously 
present in it is (V/Vo)". Similarly, the probability that a particle is not in the 
volume V is (Vo— V)/Vo, and the same probability for No—N given particles 
simultaneously is [(V>—V)/Vo}*°—%. The probability wy that the volume V 
contains N molecules in all is therefore given by 


No! V N V \No-N 
*y = IGN (Va) =) a) 


where a factor has been included which gives the number of possible ways of 
choosing N out of No particles. 

In the case under consideration, V < Vo, and the number N, though it may 
differ considerably from its mean value N, is of course assumed small com- 
pared with the total number No of particles in the gas. Then we may put 
No! = (No—N)! Ne’ and neglect N in the exponent Ny)—N, obtaining 


wr cz (NV VN (y_ VV 
v= at (Fe) ( 0) " 
But NoV/Vo is just the mean number N of particles in the volume V. Hence 
NN 1 N \No 
ww = er ( =i) 
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Finally, using the well-known formula 


we replace (1 —N/No)® with No large by e~ and obtain the required proba- 
bility distribution in the formt 
NNe-N 


Ni (114.3) 


Wy = 


This is called Poisson’s formula. It is easily seen to satisfy the normalisation 
condition 
Pe wn = 1. 
=O 


From this distribution we can calculate the mean square fluctuation of the 
number of particles: 


co = eo NNN 
(N*®) = ees =e YD Wor Di 


- co NN aun oo NN ne N 
=e" Vane tener 
Hence we find as before 


(ANY) = (N®)—N? = N, (114.4) 


and the mean square fluctuation of the number of particles is equal to N for 
any value of N, and not only for large values. 

We may note that formula (114.3) can also be derived directly from the 
Gibbs distribution. According to the latter, the distribution of N gas parti- 
cles, considered simultaneously, among various quantum states is given by 
the expression exp {(2+-uN—e,)/T}, where Xe, is the sum of the energies 
of the individual particles. To derive the required probability wy we must 
sum this expression over all particle states belonging to a given volume V. If 
we sum over the states of each particle independently, the result must be 
divided by N! (cf. § 41), giving 


eat N eat N 
Wy = Wr (z conn) = Wr (5 ne) : 


t For small fluctuations (| N—N | << N with N large), this formula naturally 
becomes (114.1). This is easily seen by using Stirling’s asymptotic expression for the 
factorial of a large number N, N! = «/(22N)-NNe-N, and expanding log w, in 
powers of N—N. 
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The sum is just the mean number N of particles in the volume considered. 
Thus we find 

wy = constantxN/N!, 
and the normalisation condition then shows that the constant is e~¥,t giving 
again formula (114.3). 


§ 115. Fluctuations in solutions 


The fluctuations of thermodynamic quantities in solutions can be calculat- 
ed by the same method as that used in§ 112 for fluctuations in bodies consist- 
ing of identical particles. The calculations are considerably simplified by the 
following argument. 

Let us take some small part of the solution, containing a given number N 
of solvent molecules, and try to calculate the mean fluctuation of the number 
n of solute molecules in that part of the solution or, what is the same thing, 
the fluctuation of the concentration c = n/N in that part. To do so, we must 
consider the most complete equilibrium of the solution that is possible for 
a given non-equilibrium value of n; cf. the first footnote to § 111. Taking a 
given value of the concentration does not affect the establishment of equilib- 
rium between the small part considered and the remainder of the solution as 
regards exchange of energy between them or change in their volumes. The 
former means (see § 9) that the temperature remains constant throughout the 
solution; the latter means that the pressure remains constant throughout the 
solution (§ 12). Thus to calculate the mean square ((Ac)*), it is sufficient to 
consider the fluctuations of concentration occurring at constant temperature 
and pressure. 

This fact in itself signifies that the fluctuations of concentration are statis- 
tically independent of those of temperature and pressure, i.e.t 


(AT Ac) = 0, (AcAP) = 0. (115.1) 


t That is, 2 =—PV=—wNT, in accordance with the equation of state of an 
ideal gas, 

t This may be more rigorously proved by the method indicated in the third 
footnote to § 112. Using the formula dE = TdS—P dV+yp’ dn (with N constant), 
we can rewrite (96.1) as 


dRinin —_ (T—T)) dS —(P—Pp) dV+(u’—pW9) dn. 


Thus, if we take x, = AS, x» = AV,xs = An, thenX, = AT/T, Xo = —AP/T, X3 = 
Ap’/T are thermodynamically conjugate to them. The equations (115.1) then 
follow, since (x,X,) = 0, (x3X_) = 0. 
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The minimum work necessary to change the number n by An at constant 
pressure and temperature is, by (96.1), Ri, = 4@—p’ An, where p’ is the 
chemical potential of the solute. Expanding 4® in powers of An, we have 


~ (8 oD Dpg Ao Ow) Lene 
a0 = (Sr), ant (Sar), 2a = want (Se) CA 


and hence 


Rag ; (Fr) (An). 
P,T 


Substituting this expression in the general formula (112.1) and comparing 
with the Gaussian distribution formula (110.5), we obtain for the required 
mean square of the fluctuation of the number 2 


T 
((4n)*) = Gu amen’ (115.2) 


or, dividing by N?, for the mean square fluctuation of concentration 


T 
Ac») = ——— 115.3 
( c)*) N(e Ww /8 c)p, ( ) 
The latter expression is inversely proportional to the amount of matter (NV) in 
the small part considered, as it should be (see the discussion following 
(112.11)). 
For weak solutions, du’/dn = T/n, and formula (115.2) gives 


(An)*) = n. (115.4) 


It should be noticed that there is a complete analogy (as was to be expected) 
with formula (113.1) for the fluctuations of the number of particles in an 
ideal gas. 


§ 116. Spatial correlation of density fluctuations 


The statement that in a homogeneous isotropic medium (gas or liquid) all 
positions of the particles in space are equally probable applies to any one 
particle on condition that all other particles can occupy arbitrary positions, 
and is not, of course, in contradiction with the fact that the interaction 
between different particles must cause some correlation in their positions. 
If we consider, say, two particles simultaneously, then for a given position 
of one particle the various positions of the other will not be equally probable. 

Let n(r) denote the exact (fluctuating) number density of particles. The 
product n dV is the number of particles present in a volume element dV at 
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a given instant. To represent the correlation between the positions of par- 
ticles at two points in space, we use the spatial correlation function of the 


density fluctuations: 
(Any Anz) = nne—in’, (116.1) 


where An = n—A and the suffixes 1 and 2 denote the values of n(r) at two 
points ri and re. In a homogeneous isotropic medium, the correlation 
function depends only on the magnitude r = |ra—r,| of the distance be- 
tween these points. When r + , the fluctuations at the points r; and re 
become statistically independent, and the correlation function therefore 
tends to zero. 

It is useful to clarify the significance of this correlation function in the 
following way. Since the volume dV is infinitesimal, it cannot contain more 
than one particle at a time; the probability of finding two particles in it at 
once is an infinitesimal quantity of a higher order. The mean number of 
particles 7 dV is therefore also the probability that a particle is present in dV’. 
Let fiwio(r) dV denote the probability that there is a particle in the volume 
element dV when there is one in dV1 (wiz > 1 asr + ©). It is evident from 
the foregoing that the mean value 


(ny dV 1-ne dV 2) = Tn dV 1°1wi2 dV > . 


Hence (nia) = wiei?. In this equation, valid when r1 ¥ ra, we cannot, 
however, go to the limit re + ri, since the derivation ignores the fact that, 
if the points 1 and 2 coincide, a particle in dV; is also in dV. A relation 
which takes account of this is clearly 


(nyn2) = A’wia+nd(re—11). (116.2) 


For let us take a small volume AV, multiply (116.2) by dV1 dV, and integrate 
over AV. The term 7712 then gives a second-order small quantity (propor- 
tional to (AV)*); the term containing the delta function gives a first-order 
quantity, 7 AV, as it should, since (to the first order) the small volume can 
only contain either no particle or one particle. 
The delta-function term is conveniently separated also from the correla- 
tion function (116.1), by writing the latter as 
(An Anz) = no(re—r1) + nv(r), (1 16.3) 
where 
v(r) = n[wie(r)— 1]. (116.4) 
The term “correlation function” will be applied to both the original quantity 
(Am Ang) and the function »(r).* 


t The function »(r) differs from @,,(r) in § 79 as regards the normalisation: 
AW 9 = P, 
24 
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Let us now integrate (116.3) with respect to V; and V2 over a finite volume 
V. With N the total number of particles in this volume (so that AV = N), 
we find 

(ANY) = N +H ff or) dVi dV, 
or changing from the integration over V1 and V2 to one over the coordinates 
of one particle and the relative coordinates r = re—r1, 


[ var og MAN) 2g: (116.5) 
5 N 

Thus the integral of the correlation function over a certain volume is 
expressed in terms of the mean square fluctuation of the total number of 
particles in that volume. Using for the latter the thermodynamic formula 
(112.13), we can express the integral in terms of the thermodynamic quanti- 


ties: 
TN [av 
[vay =—Fe (ap) -1 (116.6) 


In an ordinary (classical) ideal gas this gives f v dV = 0, as it should: in 
such a gas there is no correlation between the positions of different particles, 
since they do not interact with one another, either directly or by exchange 
interaction as in a quantum ideal gas. 

On the other hand, in a liquid (far from the critical point) the first term in 
(116.6) is small compared with unity, because the compressibility of a 
liquid is small. In this case the integral is close to —1. The principal 
interaction forces between the liquid particles have a range of the order 
of the molecular dimensions a. When these forces are taken into account, 
the correlation function »(r) decreases exponentially with increasing distance, 
the exponent being ~ —r/a.t 

Since the fluctuations of density and temperature are statistically independ- 
ent, the temperature may be regarded as constant in considering the fluctua- 
tions of density. The total volume of the body is also constant, by definition. 
Under these conditions the minimum work needed to bring the body out of 
equilibrium is equal to the change AF, in the total free energy of the body, 
so that the probability of the fluctuation may be written as 


wo en4FiT , (116.7) 
The change AF, due to the density fluctuations can be put in the form 
AF, = + Jf (7) Ani Ana dV dV 2. (116.8) 


t The value —1 corresponds, as it were, to the mutual impenetrability of the 
liquid particles, regarded as closely packed solid spheres. 

t There are also, however, weak longer-range (van der Waals) interaction forces, 
which give rise to a term in the correlation function that decreases less rapidly 
<by a power law) with increasing distance (see Part 2). 
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We shall show how the correlation function »(r) may be found from the 
function ¢(r).* 

Let us consider a body of large but finite volume V, and expand 4n as a 
Fourier series: 


An = y Amen, An, = 7 [ares dV; (116.9) 


since An is real, An_, = An,. When these expressions are substituted in 
(116.8) and the integration is carried out, all the terms containing products 
An, An é**®** with k’ + —k give zero, and the result is 


AF, = iV | Any |? b(k), (116.10) 


where the same letter @ with the new argument k denotes the Fourier com- 
ponent of @(r): 
o(k) = | d(e-* dV. (116.11) 


Since each term in the sum (116.10) depends on only one of the 4n,, the 
fluctuations of the different 4n,, are statistically independent. Each square 
{|An,,|? appears in the sum twice (for +k), so that the probability distribution 
for its fluctuations is 


precteen |-7 $(K) [Anu 


Lastly, since |4n,|? is the sum of the squares of two independent quantities 
(An,, being complex), we find from this for the mean square fluctuation 


(An |2) = TIVO(K). (116.12) 


Multiplying both sides of (116.3) by exp (—iker) = exp [—ike(re2—ri)] and 
again integrating with respect to V1 and V2, we have 
({Ang|?) = ALL +o(k)/V,  v(k) = fore dV. (116.13) 


Substitution of (116.12) finally gives the required result 


r 
Hk) = sel. (116.14) 


PROBLEM 
Find the first term in an expansion of the correlation function for a rarefied 
gas in powers of N/V. 
t In mathematical parlance, (7) is the second variational derivative of AF’ 
with respect to n(r). 


24* 


354 Fluctuations 


SOLUTION. We start from formula (79.2). In the first approximation, we can 
assume that all particles except for two specified ones are far apart and their inter- 
action is negligible, so that the integration gives V’—?. To the same accuracy, 
F = Fig. The result is 

wr) = file“ VOT 1}, 


where U(r) is the interaction energy for two gas particles. 
It may be noted that substituting this expression in (79.1) gives as the energy 
of the gas 


N? v 
io Luts v| u(i+5) dV 


= N? U/T 
= Eigt+ Ue~™"" dP. 
‘id 2V | (ellen 


This result is, of course, in agreement with formulae (74.4) and (74,5) for the free 
energy of a slightly non-ideal gas. 


§ 117. Correlation of density fluctuations in a degenerate gas 


As already mentioned in§ 116, in a classical ideal gas there is no correlation 
between the positions of the various particles. In quantum mechanics, 
however, such a correlation exists, because particles in an ideal gas interact 
indirectly, on account of the principle of symmetry of wave functions.t 

The problem of determining the correlation function in a degenerate gas 
can be most simply solved by the second quantisation method, which has 
already been used in § 80 to calculate the energy of an electron gas. In this 
method, the particle number density corresponds to the operator 


Ar) = $+ (0) HH); 
on substitution of the p-operators (80.5), it is expressed by the sum 
A) =D Belper) vy), (117.1) 
»P,p 


oO, 
where the summation is taken over all values of the momenta p and p’ (for 
free particles in the volume V) and over the spin components a, o’.t However, 
since the spin wave functions for different values of o are orthogonal, only 


t The correlation of fluctuations in a Fermi gas was considered by V. S. Fursov 
(1937), and in a Bose gas by A. D. Galanin (1940). 

t It may be recalled that the wave functions of a particle having spin are spinors, 
and that the product of wave functions in (117.1) is in fact the “scalar product” of 
covariant and contravariant spinors with the appropriate summation over spinor 
indices; the latter should not be confused with the suffixes o and o’ which denote 
the eigenvalues of the spin component in the states concerned. 
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the terms in the sum that have o = o’ are in fact non-zero. In the products 
Yrop’g» the normalised spin factors give unity, so that the wave functions 
can be written simply as coordinate plane waves: 


1 


Pp = /V efperih ° dl 17.2) 


It is easily seen that the diagonal terms (p = p’) of the sum (117.1) give 
just the mean density #: since the operator 474,, is simply the number 7,, 
of particles in the corresponding quantum state, the sum of these terms is 

1 N = 
ve Mpa = V =. 
We can therefore write 
Ah = A(r)—Ai 
=) apebyetpe py» (117.3) 
9, D, p’ 
where the prime to the summation sign denotes that the diagonal terms are 
to be omitted. Using this expression, we can easily calculate the required 
mean value (47; Ane). 

This mean value is calculated in two stages. First of all, the quantum aver- 
raging over the states of the particles is to be carried out. This amounts to 
taking the corresponding diagonal matrix element of the quantity concerned. 
Multiplying together the two operators (117.3), which belong to two differ- 
ént points r; and rg, we obtain a sum of terms containing various products of 
the operators @,, and a} taken four at a time. But among these products 
only those which contain two pairs of operators d,, and 4%, with the same 
suffixes have non-zero diagonal matrix elements, i.e. the relevant terms are 

Y Gpadyolipo PS(t1) Wo (ts) y(t2) volta): 


o,D,D 
These terms form diagonal matrices with 
Orrolig'o 7 LF yo; Opole = Myo 5 


here and henceforward, the upper sign refers to the case of Fermi statistics 
and the lower sign to that of Bose statistics. Substituting also the functions 
tp, (117.2), we obtain 


— YY (LF tye) tpeelo—P)-a—eDIh , 


o,D,P 
This expression must now be averaged in the statistical sense, i.e. over the 
equilibrium distribution of particles among the various quantum states. 


Since particles in different quantum states behave independently of one 
another, the numbers n,, and ny, are averaged independently. The required 
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mean value is thus found to be 
(Ani Ane) = a a (1 FE Nye) Mpae!P P+ a1) (117.4) 
oO, Dp, 


We now change in the usual way from summation over p and p’ to inte- 
gration over V d3pV d'p’/(27h)*; the restriction p ~ p’ then becomes unim- 
portant. The integral separates into two parts, of which the first is 


= Pe as 
” | Npgel®—P): Car) Onky 


The integration over d°p’/(27h) gives the delta function 6(r2—r1), which 
enables us to put re—ri = 0 in the ae integrand; this leaves 


6(r2— DE [Me oa Onn = = 70(re—"). 


This is just the first term in (116.3). The correlation function (the second 
term in (116.3)) is therefore 


wr) = F= aol fem An, CaF | Pi (117.5) 


In a gas in equilibrium, the distribution of particles among quantum states 
is given by the Fermi or Bose distribution formula: 


Np = My = [e€-Y/T+ 1-1. (117.6) 
These numbers are independent of o; the summation over a in (117.5) there- 


fore gives simply a factor g = 2s+1 (where s is the spin of the particle). 
Thus we have finally the following formula for the correlation function:* 


+ | elper/h d3p ie W 
WS #4 aoorar aT ET OnhyP wae) 
or, after integration over the directions of p, 
re 
= g sin (pr/fi)p dp | 
v(r) = + rahe “eenreel™ | (117.8) 


We may also state a formula for the mean squares of the Fourier compo- 
nents of the density fluctuations, which is easily obtained by substituting 
“(r) from (117.7) in the general formula (116.13) and integrating over the 
coordinates :t 


(\Amgl®) = & | ral Farm) pee (117.9) 


t For a Bose gas, this formula applies only to temperatures above the Bose— 
Einstein condensation point (see Problem 4). 

t The Fourier components 4n, of the gas density fluctuations should not be 
confused with the particle quantum state occupation numbers 4. 
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Formula (117.7) shows first of all that for a Fermi gas »(r) < 0, but for a 
Bose gas »(r) > 0. In other words, in a Bose gas the presence of a particle 
at Some point in space increases the probability that another particle is near 
that point, i.e. the particles “attract” one another; in a Fermi gas, on the 
contrary, they “repel” one another (cf. the remark at the end of § 56). 

In accordance with the discussion at the beginning of this section, if we 
go to the limit of classical mechanics, the correlation function tends to zero: 
asf—-0, the frequency of the oscillating factor exp (iper/f) in the integrand 
in (117.7) increases without limit, and the integral tends to zero. 

As r—0, the function »(r) tends to a constant limit: 


(0) = F£.! ales pal ae | =F2. (117.10) 


Let us apply the formula (117.8) to a Fermi gas at T = 0. In this case the 
distribution function is a step function: n, = 1 for p < pg and n, = 0 for 
P > Pp, where pz = h(6n°7i/g)3 is the limiting momentum. We therefore 
find 

| Per 2 
V(r) = — at | PN 


,o 


Let us take fairly large distances, assuming that p,pr/h >> 1. Accordingly, we 
retain only the term in the lowest power of 1/r in the integral: 


3h prr 
= —-— Joel tae 
v(r) intpsrl cos? 
The squared cosine varies rapidly over distances Ar that are small in compar- 
ison with the distances considered. Averaging over such a distance, we find 


v(r) = —3h/42* pert. (117.11) 


PROBLEMS 


PROBLEM 1. Determine the mean square of the Fourier component (with small 
wave numbers: k < pr/f) of the density fluctuations in a Fermi gas at T = 0. 


SOLUTION. The integrand in (117.9) is non-zero (and equal to unity) only at 
points where mp = 1, tps ax = 0, i.e. at points in a sphere of radius pr and also not 
in a similar sphere whose centre is shifted by #k. A calculation of the volume of this 
region when vik <« pr gives 


(| Any |?) 


segkpp/(200)*h?V 
3khn/4prV. 


PROBLEM 2. Determine the correlation function for a Fermi gas at temperatures 
small compared with the degeneracy temperature. 
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SOLUTION. In the integral in (117.8) we put = &-= p}/2m and transform 
it as follows: 


pe i psin(pr/fijdp __, @ ik (pr/f) dp 


e@—er)/T 4] aia “eem~erT+ 1” 


Integration by parts is followed by a change to the variable x = pp(p—pp)/mT. 
Since T is small, the integrand decreases rapidly with increasing |x|, and the 
integral over x can therefore be extended from — o to oo: 


o 1 C sin (2F" dx 
saa in (F ca 4 (+1) (e-*+1) 
oO sin ppr/it elArx _ a : 
or r (e* +1) (e-* +1) 
where 1 = mT/fipy. The resulting integral is transformed by the substitution 
1/(e*+1) = u into Euler’s beta integral, giving 


ae wh |soper 
vii Br (saad = a 


For distances r >> fi/pp, averaging the rapidly varying squared cosine, we have 
finally 


3(mT}* gemTr \? 
ek sinh~2 ‘ 
(eat \dfippr™ ( fipp 


As T — 0, this becomes (117.11). In the asymptotic region, where prr/f is large 
compared with ¢,/T as well as with unity, we find 


2. 
oS (2) 


PROBLEM 3. Determine the correlation function for a Bose gas at large distances 
(r>> fil\/(mT)) for temperatures above the point J) at which Bose-Einstein 
condensation begins, but close to this point. 


SOLUTION. Near the point 7, the chemical potential || is small (see § 62, 
Problem). The integral in (117.7), which we denote by /, is then determined by 
small values of p: €/T ~ p?/mT ~ |u\/T <« 1. Hence, expanding the integrand in 
powers of ¢ and p, we findt 

eprein d’p (2m | |? 
I=T exp 3 —r——__——__ }. 
Plam-+ iu) Oa = ar fi 


t A Fourier transformation formula is used: 


Yaa 4 ek-r = g3x en*r 
___. pwiker ae ge . 
! al ay w2-4-42 ” |sz2 


‘This is most simply derived by noticing that the function .: = e~*"/r satisfies 
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Finally 


272 2(2m 
Wr) = 7 aaprh oP {- ann. 


PROBLEM 4. Determine the correlation function for a Bose gas with T< Tp. 


SOLUTION. For T< 7p, a finite fraction of the number of particles (N,_) is in 
states with p = 0 (the condensate). Returning to the expression (117.4), we must 
first (before changing from summation to integration) separate the terms with p 
or p’ = 0, noting that the number of particles in each quantum state with p = 0 
iS My 9 = N,u0/¥. The sum is then transformed as in the text, and instead of (117.7) 
we find 


= 20 & — [ ip-rimq-_ OP 
a a i= [e "> mt)’ 


where ny = N,W9/g; " is given by the Bose distribution formula with u = 0: 
n = [e/7—1]-1, 


At distances r >> fi/4/(mT), the integral 1 = m7/2xfr (from Problem 3 with 
p4 = 0), so that 
__ mTng gmt? | 
ae unr * Ar ABe * 


the second term may be neglected if T is not too close to Ty (and so ny is not too 
small). In the opposite case, at distances r « h/4/(mT), the integral 


— dp A—No 
~ | "anne ~~ ¢ 


I 


So that 


wr) = (0) = (#—n})/gA. 


Note that the integral f v dV for a Bose gas diverges if T < 7p, and the calcula- 


tion from (116.5) would therefore lead to an infinite value of the fluctuation of the 
number of particles, in accordance with the comment in § 113. 


§ 118. Correlations of fluctuations in time 


Let us consider a physical quantity which describes a closed system in ther- 
modynamic equilibrium, or part of such a system. (In the former case, the 
quantity must not be one that by definition remains constant for a closed 


the differential equation 
Agd—wo = —42 50). 


Multiplying this by e~'*-t and integrating over all space (with twofold integration 
by parts in the first term), we obtain the required result. 
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system suchas the energy.) This quantity will undergo small variations in time, 
fluctuating about its mean value. Let x(t) again denote the difference between 
the quantity and its mean value (so that = 0). 

There is some correlation between the values of x(t) at different instants, 
this means that the value of x at a given instant ¢ affects the probabilities 
of its. various values at a later instant ¢’. In the same way as for the spatial 
correlation discussed in the preceding sections, we can characterise the time 
correlation by the mean value of the product (x(1) x(?’)). The averaging here 
is, as usual, understood in a statistical sense, i.e. as an averaging over the 
probabilities of all possible values of the quantity x at the times ¢ and /’. 
As has been mentioned in § 1, this statistical averaging is equivalent to a time 
averaging—in this case over one of the times ¢ and ¢’ for a given difference 
# —1’. The quantity thus obtained, 


f(t’ —t) = (x(t) x(’)), (118.1) 


depends only on the difference ¢’ — t, and the definition may therefore also be 
written 

(t) = (x(0) x(t)). (118.2) 
As the time difference increases without limit, the correlation clearly tends 
to zero, and accordingly the function $(*) likewise tends to zero. Note also 
that, because of the obvious symmetry of the definition (118.1) as regards 
the interchange of t and ?’, the function (/) is even: 


p(t) = o(—2). (118.3) 


By regarding x(#) as a function of time, we have implicitly assumed that it 
behaves classically. The definition given above can, however, be put in a 
form that is applicable to quantum variables also. To do this, we must 
consider in place of the quantity x its quantum-mechanical time-dependent 
(Heisenberg) operator X(t). The operators £(¢) and X(¢) relating to different 
instants do not in general commute, and the correlation function must now 
be defined as 

H(t — 1) = RAY) R(C') + 8) (0), (118.4) 


where the averaging is with respect to the exact quantum state.‘ 
Let us assume that the quantity x is such that by specifying a definite 
value of it (considerably greater than the mean fluctuation (x?)"”) one could 


t It may be again mentioned that, according to the basic principles of statistical 
physics, the result of the averaging does not depend on whether it is done mechani- 
cally with respect to the exact wave function of a stationary state of the system, 
or statistically by means of the Gibbs distribution. The only difference is that in 
the former case the result is expressed in terms of the energy of the body, and in 
the latter case as a function of its temperature. 
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characterise a definite state of partial equilibrium. That is, the relaxation 
time for the establishment of partial equilibrium for a given value of x is 
assumed to be much less than that required to reach the equilibrium value of 
x itself. This condition is satisfied for a wide class of quantities of physical 
interest. The fluctuations of such quantities will be said to be guasi-station- 
ary.’ In the rest of this section we shall consider fluctuations of this type and, 
moreover, shall assume the quantity x to be classical.t 

We shall also assume that, as complete equilibrium is approached, no 
other deviations from equilibrium arise in the system which would require 
the use of further quantities to describe them. In other words, at every instant 
the state of the non-equilibrium system must be entirely defined by the value 
of x; a more general case will be discussed in § 119. 

Let the quantity x have at some instant t a value which is large compared 
with the mean fluctuation, i.e. the system be far from equilibrium. Then we 
can say that at subsequent instants the system will tend to reach the equi- 
librium state, and accordingly x will decrease. Under the assumptions made, 
its rate of change will be at every instant entirely defined by the value of x at 
that instant: x = x(x). If x is still comparatively small, then x(x) can be 
expanded in powers of x, keeping only the linear term: 


dx/dt = — Ax, (118.5) 


where A is a positive constant; there is no zero-order term in this expansion, 
since the rate of change dx/dt must be zero in complete equilibrium (i.e. at 
x = 0). Equation (118.5) is the linearised macroscopic “equation of motion” 
of the non-equilibrium system, describing its relaxation process (the physical 
nature of which depends entirely on the nature of the quantity x). The 
constant 1// determines the order of magnitude of the relaxation time for the 
establishment of complete equilibrium. 

Returning to fluctuations in an equilibrium system, let us define a quan- 
tity €,(¢) as the mean value of x at an instant ¢ > 0 with the condition that it 
had some given value x at the prior instant ¢ = 0; such a mean value is in 
general not zero. Evidently the correlation function ¢(#) may be written in 
terms of &,(1) as 


b(t) = (xé(D), (118.6) 


where the averaging is only over the probabilities of the various values of x at 
the initial instant t = 0. 


* This term is more appropriate than thermodynamic, as used in the previous 
edition of the book. 

t The final formulae for the quasi-stationary fluctuations of quantum variables 
are obtained from that for classical variables by a simple modification described 
in § 124 (see (124.19)). 
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For values of &, large compared with the mean fluctuation, it follows from 
(118.5) that also 


dé())/dt =—JE(), 1t>0. (118.7) 


Taking into account the fact that ¢,(t) is an averaged quantity, we must 
expect that this equation is true even when &, is not large. Integrating (118.7) 
we find, since £,(0) = x by definition, 


E(t) = xe~*, 


and finally, substituting in (118.6), we obtain a formula for the time corre- 
lation function: 


P(t) = e"™, 


This formula as it stands is, however, valid only for ¢ > 0, since in its 
derivation (equation (118.7)) we have assumed that the instant ¢ is later 
than ¢ = 0. On the other hand, since $(¢) is an even function, we can write 
the final formula as 


P(A) = (xP)e“4IFl = (L/Be*l4, (118.8) 


with (x?) from (110.5), and it is then applicable for both positive and nega- 
tive t. This function has two different derivatives at ¢ = 0, the reason being 
that we are considering time intervals long compared with the time for the 
establishment of partial equilibrium (equilibrium with a given value of x). 
The consideration of short times, which is not possible within the “quasi- 
stationary” theory, would of course give d¢@/dt = 0 for t = 0, as must be 
true for any even function of ¢ that has a continuous derivative. 

The above theory can also be formulated in another way that may have 
certain advantages. 

The equation X =—Ax for the quantity x itself (rather than its mean 
value &,) is valid, as already mentioned, only when x is large compared with 
the mean fluctuation of x. For arbitrary values of x, we write 


K=—Axt+y, (118.9) 


thus defining a new quantity y. Although the magnitude of the oscillations 
of y does not change with time, when x is large (in the sense already defined) 
y is relatively small and may be neglected in equation (118.9). It is necessary 
to regard the quantity y thus defined (called the random force) as the source 
of the fluctuations of x. The correlation function of the random force, 
<y(0) y(t)), must be specified in such a way as to lead to the correct result 
(118.8) for (x(0) x(7)). To do so, we must put 


(y(0) (0) = 22x%) (4) = (24/8) 8(). (118.10) 
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This is easily seen by writing the solution of equation (118.9): 


x(t) = e-# f y(r)e** dr, 


and averaging the product x(0) x(#) after expressing it as a double integral. 

The vanishing of the expression (118.10) when ¢ + 0 signifies that the 
values of y() at different instants are uncorrelated. In reality, of course, 
this statement is an approximation and signifies only that the values of y(#) 
are correlated over time intervals of the order of the time for partial equi- 
librium to be established (equilibrium with a given value of x), which, as 
already mentioned, is regarded as negligibly small in the theory given here. 


§ 119. Time correlations of the fluctuations of more than one variable 


The results obtained in § 118 can be generalised to fluctuations in which 
several quantities x,, X2, ..., X, simultaneously deviate from their equilib- 
rium values. We shall again suppose that the latter have been subtracted, 
so that each mean value x, = 0. 

The correlation functions for the fluctuations of these quantities are de- 
fined (in the classical theory) as 


bul —1) = (xd) xed). (119.1) 
By virtue of this definition, they have the obvious symmetry property 
dit) = Ox{—D). (119.2) 


There is, however, another symmetry property of the correlation func- 
tions, which has a profound physical significance. It arises from the sym- 
metry under time reversal of the equations of mechanics which describe the 
motion of the particles in the body.’ Because of this symmetry, it does not 
matter which of the quantities x, and x, is taken at the earlier or later in- 
stant in the averaging. Hence (x,(1’) x,(4)) = (x(t) x; (0')), ie. 


bit) = bu(—d. (119.3) 


From (119.2) and (119.3) it also follows that $,() = @,,(#). 

In this derivation we have tacitly assumed that the quantities x, themselves 
are unaffected by time reversal. There are, however, quantities which change 
sign under time reversal, for instance quantities proportional to the velocities 
of some macroscopic motions. lf both quantities x, and x, have this property, 


tThe system is assumed to be not in a magnetic field and not rotating as a whlole 
(see § 120). 
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the relation (119.3) is again valid, but if only one of the two changes sign the 
symmetry under time reversal implies that (x?) x,(1)) = —(x,(t) x,(t)), 1. e- 


biult) = — bu —2). (119.4) 


Together with (119.2), this gives (4) = — @,,(¢). 

We shall now assume, as in § 118, that the fluctuations are quasi-stationary, 
i. e. that the set of values of x,..., x, (outside the limits of their mean 
fluctuations) determines a macroscopic state of partial equilibrium. As 
complete equilibrium is approached, the x, vary with time; we assume that 
the set of functions x,() completely describes this process, and that no other 
deviations from equilibrium occur in it. Then the rates of change of the x, 
in any non-equilibrium state are functions of the values x, ..., x, in that 


state: 
Xp KKy. © 625 Xp)s (119.5) 


If the system is in a state fairly close to complete equilibrium, i.e. if the x, 
may be regarded as small, the functions (119.5) may be expanded in powers 
of the x, as far as the first order, i.e. as the linear sums 


Xi = — Ain Xk (119.6) 


with constant coefficients 4,,; these expressions are the generalisation of 
(118.5).? 

From this we can go to the equations for the correlation functions in the 
same way as in § 118. We define the mean values &,(4) of the quantities x, at a 
time ¢ > 0 for given values of all the x1, x2,... at the earlier time ¢ = 0; 
these values themselves are, for brevity, omitted from the notation &,(t). 
These quantities satisfy the same equations (119.6): 


E; = — duke, (119.7) 


and do so for arbitrarily small values of £,(f) as well as for those which 
are large compared with the mean fluctuations. The correlation functions 
are obtained from the &,(f) by multiplying by x, = x0) and averaging with 
respect to the probabilities of the various values of x,: dg{t) = E(x). 
This operation performed on equation (119.7) gives 


d 
ar bit) = —Andult), (t> 0) (119.8) 
the suffix / in these equations being a free suffix. 
As already mentioned, equations (119.6) are just the linearised macro- 


scopic “equations of motion” of the non-equilibrium system, describing its 


t As in§ 111, summation from 1 to” over repeated Latin suffixes is implied. 
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relaxation process. We see that the set of equations for the correlation 
functions of the fluctuations is obtained by simply replacing the quantities 
x,(t) in these “equations of motion” by functions ¢,(t) with a free suffix / 
that takes all values from 1 to n. The equations thus obtained refer to times 
t > 0, and are to be integrated with the “initial condition” 


bix(0) = (x0) xx(0)) = (xiX%%) = Boies (119.9) 


the mean values {x;,x,) must be equal to their known values (111.9). For 
times ¢ < 0, the correlation functions are then found directly from their 


symmetry properties. 


§ 120. The symmetry of the kinetic coefficients 


Let us return to the macroscopic equations (119.6), which describe the rela- 
xation of a slightly non-equilibrium system:* 


Xi = —AikXk. (120.1) 


These equations have a deep-lying internal symmetry, which becomes 
explicit only when the right-hand sides are expressed not in terms of the 
macroscopic quantities x, themselves (whose rates of change are on the 
left-hand sides) but in terms of the thermodynamically conjugate quantities 


X; = —0S/0x;, (120.2) 


which have already been used in § 111. In a state of equilibrium, the entropy 
of the system is a maximum, so that all the X, = 0. If x,, x, ... are not 
zero but comparatively small (i.e. in slightly non-equilibrium states of the 
system), the XY, may be expressed as linear functions: 


Xi = BikXk- (120.3) 


The constant coefficients 6, are the first derivatives of the X,, i.e. the second 
derivatives of S, and therefore 


Bie Bex, (120.4) 


t In particular applications, cases occur where the complete equilibrium which 
is being approached depends on external parameters (such as volume or external 
field) which themselves vary slowly with time; the equilibrium (mean) values of 
the quantities considered therefore vary also. If this variation is sufficiently slow, 
we can again use the relations derived here, except that the mean values x, can- 
not be regarded as always equal to zero. If they are denoted by xf, then (120.1) 
must be replaced by 


Ry = —Ain(x~— 2). (120.1a) 
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If we express the quantities x, in terms of the Y, from (120.3) and substi- 
tute in (120.1), we obtain the relaxation equations in the form 


Xi = —yieXr, (120.5) 


‘where 
Vie = Aup'e (120.6) 


are new constants called kinetic coefficients. We shall now prove the principle 
of the symmetry of the kinetic coefficients or Onsager’s principle (L. Onsager, 
1931), according to which 


Vik = Vki- (120.7) 


The proof is based on the fact mentioned in § 119 that similar equations 
(120.1) and (120.5) are satisfied by the quantities that describe fluctuations 
in an equilibrium system. We define mean values &(#) of the fluctuating 
quantities x,, and mean values &,(#) of the X,, at a time t, for given values 
of x1, Xe, ... at t= 0; then 


Es=—yinE,  (t > 0) (120.8) 


We now make use of the symmetry of the fluctuations (in an equilibrium 
system) under time reversal, expressed by the relation (119.3), which may be 
written 

(xi(t) x(0)) = (x,(0) x{)), (120.9) 
or, with the £,(2), 


EXO) xk) = (xi), (120.10) 


where the averaging is with respect to the probabilities of the various values 
of all the x, at t = 0. We differentiate this equation with respect to ¢ and 
substitute the derivatives é, from (120.8): 


ViKE(Oxk) = YeKxiZ (0). 


When t = 0, the £, are evidently equal to X,(0); hence, putting ¢ = 0 in the 
above equation, we get 
ViKX Xk) = Vu XiXi), 


where the two factors in the averaged products are taken at the same instant. 
From (111.8), such mean values (X,x,) = 64, and we arrive at the required 
result (120.7)." 


t Here it would be incorrect to use instead of (120.9) the relation (119.2), which 
gives (x0) x,(t)) = (x(—2) x,(0)). It can be shown that differentiating this equa- 
tion with respect to ¢ and then putting ¢ = 0 gives (with the use of (120.9) ()x,) = 0. 
In reality, however, the functions d,(2) (like f(t) in § 118) have in this approx- 
imation two different derivatives at t= 0, according as t--+0 ort -»—0. 
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The following two comments should, however, be made regarding this 
result. The proof depends on the symmetry of the equations of motion with 
respect to time, and the formulation of this symmetry is somewhat altered 
for fluctuations in a uniformly rotating body and for bodies in an external 
magnetic field: in these cases the symmetry under time reversal holds only 
if the sign of the angular velocity of rotation 2 or of the magnetic field H 
is simultaneously changed. Thus in these cases the kinetic coefficients dep- 
end on 9 or H as parameter, and we have the relations 


Vik 2) = Ye(—2),  vieCH) = yxi(—H). (120.11) 


Moreover, it has been assumed in the derivation that the quantities x, 
and x, are unaffected by time reversal. The relation (120.9), and therefore 
the result (120.7), remain valid if both quantities change sign under time 
reversal (are both proportional to the velocities of some macroscopic mo- 
tions). But if one of x, and x, changes sign and the other remains unchanged, 
the derivation must start from (119.4) instead of (119.3), and the principle 
of the symmetry of the kinetic coefficients is formulated as 


Vik = —YVki- (120.12) 


Exactly similar results are valid for the kinetic coefficients ¢,, which appear 
in the relaxation equations when these are put in the form thermodynami- 
cally conjugate to equations (120.5): 


Xi=—Cuxe Cie = Bian. (120.13) 


The coefficients ¢, have symmetry properties similar to those of the ,,. 
This can be seen by a corresponding derivation, but is in any case evident 
because of the reciprocal relation between the quantities x, and X,; see the 
second footnote to § 111. 

If all the quantities x,, ..., x,, behave in the same way under time reversal, 
so that the matrix of the y,, is entirely symmetrical, the rates x, can be repre- 
sented as derivatives 


Xi = —of/oXx,, f= FyiXiXk; (120.14) 


of a quadratic function of the X, with coefficients y,. 
The importance of the function f is due to the fact that it determines the 
rate of change of the entropy of the system: 


S = (08/0x)x%; = —Xi% = X;, Of/0X,, 
and, since f is a quadratic function of the X,, Euler’s theorem gives 


$=2f (120.15) 
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As equilibrium is approached, the entropy increases to a maximum. The 
quadratic form f must therefore be positive-definite, and this imposes certain 
conditions on the coefficients y,,. The function f may also be expressed in 
terms of the x,, and its derivatives then give the rates X;: 


Xx; = —Of/oxi, f= AC AXiXk- (120.16) 


Here, of course, we have as before S = —x,X, = 2f. 

For a system consisting of a body in an external medium we can trans- 
form (120.15) by using the fact that the change in entropy of a closed system 
in a deviation from equilibrium is — R,,,,/T>, where R,,;, is the minimum work 
needed to bring the system from the equilibrium state into the one considered 
(see (20.8)).' Putting also R,;, = 4E—T, AS +P, AV (where E, S and V 
relate to the body, and To, Po are the temperature and pressure of the me- 


dium), we obtain 
E-—T)S+PV =—2fTo. (120.17) 


In particular, if the deviation from equilibrium occurs with the temperature 
and pressure of the body constant and equal to T and Po, 


é = —2fT, (120.18) 
and at constant temperature and volume 
F = —2/T. (120.19) 


§ 121. The dissipative function 


The macroscopic motion of bodies surrounded by an external medium is in 
general accompanied by irreversible frictional processes, which ultimately 
bring the motion to a stop. The kinetic energy of the bodies is thereby con- 
verted into heat and is said to be dissipated. 

A purely mechanical treatment of such a motion is clearly impossible: 
since the energy of macroscopic motion is converted into the energy of ther- 
mal motion of the molecules of the body and the medium, such a treatment 
would require the setting up of the equations of motion for all these mole- 
cules. The problem of setting up equations of motion in the medium which 
contain only the “macroscopic” coordinates of the bodies is therefore a 
problem of statistical physics. 


t Owing to this relation between the change of the, entropy and Ryia, the X; 


can also be defined as 
X, = (1/T >) ORmin/Ox;, (120.2a) 


which is sometines more convenient taan (120.2); cf. (22.7). 
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This problem, however, cannot be solved in a general form. Since the 
internal motion of the atoms in a body depends not only on the motion of 
the body at a given instant but also on the previous history of the motion, the 
equations of motion will in general contain not only the macroscopic co- 
ordinates Q,, Q,, ...,Q, of the bodies and their first and second time deriva- 
tives but also all the higher-order derivatives (more precisely, the functions 
Q,(t) are subject to some integral operator). The Lagrangian for the macro- 
scopic motion of the system does not then exist, of course, and the equations 
of motion will be entirely different in different cases. 

The equations of motion can be derived in a general form if it may be 
assumed that the state of the system at a given instant is completely deter- 
mined by the values of the coordinates Q, and velocities Q,, and that the 
higher-order derivatives may be neglected; a more precise criterion of small- 
ness has to be established in each particular case. We shall further suppose 
that the velocities Q, are themselves so small that their higher powers may be 
neglected, and finally that the motion consists of small oscillations about 
certain equilibrium positions. The latter is the case usually met with in this 
connection. We shall assume the coordinates Q, to be chosen so that Q, = 0 
in the equilibrium position. Then the kinetic energy K(Q,) of the system will 
be a quadratic function of the velocities Q, and independent of the coordi- 
nates Q, themselves; the potential energy U(Q,) due to the external forces 
will be a quadratic function of the coordinates Q,. 

We define the generalised momenta P, in the usual way: 


P; = 8K(O)/00}. (121.1) 


These equations define the momenta as linear combinations of the velocities ; 
using them to express the velocities in terms of the momenta and substituting 
in the kinetic energy, we obtain the latter as a quadratic function of the 
momenta, with 


Q; = OK(P)/OP,. (121.2) 


If the dissipative processes are entirely neglected, the equations of motion 
will be as in ordinary mechanics, according to which the time derivatives 
of the momenta are equal to the corresponding generalised forces: 


P, = —0U/80,. (121.3) 


First of all, let us note that equations (121.2), (121.3) are in formal agree- 
ment with the principle of the symmetry of the kinetic coefficients, if the 
quantities x,, X2) ..., Xe, defined in § 120 are taken as the coordinates Q, 
and momenta P,. For the minimum work needed to bring the bodies from 
a state of rest in their equilibrium positions to the positions Q, with mo- 
menta P, is R,,,, = K(P,))+U(Q,). The quantities X,, X4,..., X2, will 
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therefore be the derivatives (see the previous footnote) 


Xo, = (1/T) ORmin/OQ; = (1/T) 8U/0Q,, 
Xp, = (1/T) ORmin/OP: = (1/T) OK/OP; 


and equations (121.2), (121.3) will correspond to the relations (120.5) with 
Yop, = —T =—Yp,g» in accordance with the rule (120.12); this is a case 
where one quantity (Q,) remains invariant under time reversal but the other 
(P,) changes sign. 

In accordance with the general relations (120.5), we can now write the 
equations of motion allowing for dissipative processes, by adding to the 
right-hand sides of equations (121.2), (121.3) certain linear combinations of 
the quantities Xp, Xp, such that the required symmetry of the kinetic 
coefficients is maintained. It is easy to see, however, that equations (121.2) 
must be left unchanged, since they are simply a consequence of the definition 
(121.1) of the momenta and do not depend on the presence or absence of 
dissipative processes. This shows that the terms added to equations (121.3) 
can only be linear combinations of the quantities Xp, (i.e. of the derivatives 
OK/OP,), since otherwise the symmetry of the kinetic coefficients would be 
violated. 

Thus we have a set of equations of the form 


P,=— 
: 50; 27 Mk OP, 
where the constant coefficients y, are related by 
Vik = Ykt- (121.4) 
Putting 0K/OP, = O,, we have a 
P,= —30, 7 poy ViKQk » (121.5) 


These are the required equations of motion. We see that the presence of 
dissipative processes leads, in this approximation, to the appearance of 
frictional forces which are linear functions of the velocities. Owing to the 
relation (121.4) these forces can be written as the derivatives, with respect 
to the corresponding velocities, of the quadratic function 


f= 3D Yn Oie, (121.6) 
which is called the dissipative function. Then 
Pie (121.7) 


00; 00; 
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Using the Lagrangian L = K—U, we can write these equations of motion as 


4jab) &_ of 
a (80,) 30 =~" (121.8) 


which differs from the usual form of Lagrange’s equations by the presence 
of the derivative of the dissipative function on the right. 

The existence of friction causes a decrease in the total mechanical energy 
K+U of the moving bodies. In accordance with the general results of § 120 
the rate of decrease is determined by the dissipative function. This will be 
proved afresh here, since there is some difference in the notation used in the 
present section. We have 


d 5 /0K . ou oU 
at Ue 24 (ap? +39, 21) ~ .0(P+397) 


or, substituting (121.7) and using the fact that the dissipative function is 
quadratic, 


d = Of _ 
a (K+U) =—2, 036° =~ 4 (121.9) 


as it should be. 

Finally, we may mention that, when there is an external magnetic field, 
the equations of motion again take the form (121.5), but (121.4) is replaced 
by 

vicCH) = ye(—H). 


As a result, there is no dissipative function whose derivatives determine thn 
frictional forces. The equations of motion therefore cannot be written ie, 
the form (121.7). 


§ 122. Spectral resolution of fluctuations 


The spectral resolution of a fluctuating quantity x(t) is defined by the usual 
Fourier expansion formulae: 


X» = J x(ne dr, (122.1) 


and conversely 
r daw 
= —iot 
x(t) { Xwe bt mn’ (122.2) 


It should be noted that the integral (122.1) in fact diverges, since x(t) does 
not tend to zero as |t] +. This is unimportant, however, as regards the 
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formal derivations below, whose purpose is to calculate the mean squares, 
which are certainly finite. 

Substituting (122.2) in the definition (118.1) of the correlation function, 
we obtain 


bf —1) = GL) x) 


a [ | (XpXape Hots oft) 5 : (122.3) 


ev 
— oo —oo 


If this integral is to be a function of the difference t—?¢ only, the integrand 
must contain a delta function of w +m’, i.e. we must have 


(XaXe") = 27(x*), 6(@ +’). (122.4) 


This relation is to be regarded as the definition of the quantity symbolically 
denoted by (x*),,. Although the quantities x,, are complex, (x*),, is clearly 
real. The expression (122.4) is zero except when w’ = —q, and is symmetrical 
with respect to interchange of w and w’; hence (x*),, = (x*)_,,; and a change 
in the sign of w is equivalent to taking the complex conjugate. 

Substituting (122.4) in (122.3) and eliminating the delta function by inte- 
grating with respect to w, we find 


6) = [ eer (122.5) 


id 
—oo 


In particular, (0) is the mean square of the fluctuating quantity: 


oo oo 


y= [ao = (29,2. (122.6) 
a J 
—oo 0 


We see that the spectral density of the mean square fluctuation is just the 
quantity (x*),,, or 2(x*),, if the integral is taken only over positive frequen- 
cies. The same quantity is also, by (122.5), the Fourier component of the 
correlation function. Conversely 


(x2), = f b(Helm dt. (122.7) 


In the formulae given above, x(t) has been regarded as a classical quantity. 
For a quantum variable, the expansions (122.1) and (122.2) must relate to 


t We follow S. M. Rytov in the treatment of the spectral resolution of fluctua- 
tions. 
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a time dependent operator £(#), and the spectral density (x?),, is defined by 
Rae! + Xer Xa) = 2x?) 6(o +0’) (122.8) 


instead of (122.4). 

The expression (118.8) was obtained for the correlation function of quasi- 
stationary fluctuations of a single quantity. An elementary integration gives 
as its spectral resolution 

1 1 1 2A 
(x? )eo a B le ‘T| = Blo? + 22) . (122.9) 
In accordance with the physical significance of the approximation, which 
corresponds to quasi-stationary fluctuations, this expression is valid only for 
frequencies small compared with the reciprocal of the time for partial 
equilibrium to be established. 

In terms of the random force y(t) defined at the end of § 118, the time 
dependence of the fluctuating quantity x is described by the equation 
x = —Ax-+y. Multiplying by e’” and integrating with respect to t from — © 
to co (the term xe’ being integrated by parts'), we obtain (A—iw)x, = Y,- 
Hence it is evident that we must put 


(Yo = (@* +47) (X7)o = 20/8. (122.10) 


This expression can also, of course, be obtained directly from (118.10). 
The presence of 4(t) in (118.10) corresponds to the fact that (y?),, is inde- 
pendent of @ in (122.10). 

The above formulae can be immediately generalised to the simultaneous 
fluctuations of several thermodynamic variables. The corresponding corre- 
lation functions ¢,(¢) have been determined in § 119. The components of 
their spectral resolution are 


(XiXk)o = f dultye dt 
= | (x(a) xx(0))e™ de, (122.11) 
and instead of (122.4) we have 
(XioX ke!) = 22(X)Xk)o 6(o +0’); (122.12) 


the order of the factors in (x;x,),, is significant. 


t The terms in x(+ ©) are to be omitted; they occur because, as mentioned 
above, the integrals (122.1) are actually divergent. Formally, they are still unim- 
portant in the calculation of the average (y,,y,..), since they are finite when w’ = —w 
and can be omitted in comparison with the principal (delta function) term. 
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A change in the sign of the time is equivalent to the change @ - —@ in 
the spectral resolution, which in turn implies taking the complex conjugate 
of (x,x;,).. The equation ¢$,(7) = ¢,(—?) (119.2) therefore shows that 


(X XK Jo = (XKXI)—o = (EX - (122.13) 


The symmetry of the fluctuations under time reversal, expressed by equation 
(119.3) or (119.4), is written in terms of the spectral resolution as 


(XiXp)o = (XK) —o = £OX)os (122.14) 


where the + and — signs respectively relate to cases where x, and x, behave 
similarly or differently under time reversal; in the former case, therefore, 
(x;X;)» is real and symmetrical in the suffixes i and k, while in the latter case 
it is imaginary and antisymmetrical. 

The equations (119.8) are satisfied by the correlation functions of quasi- 
stationary fluctuations. They are easily solved by means of the spectral 
resolution. 

Since the equations (119.8) refer only to times t > 0, we shall apply to 
them a “one-sided” Fourier transformation, multiplying by e and inte- 
grating with respect to ¢ from 0 to oo. The term e"¢,,(#) is integrated by parts; 
since (oo) = 0, we have 


— $(0)—io(xix)P = —AnOaxn?, 
with the notation 
(x)? = f dude’ de. (122.15) 
Q 


The value of ¢,,(0) is determined by the “initial condition” (119.9); hence 


An—iodx) (xx)&? = Brh, 
or 
(¢ ik —~ toBix) (xx) 5? = Oi, 


where the coefficients 4, have been replaced by the kinetic coefficients 
Cn = BuAg (see (120.13)), which are more convenient because of their sym- 
metry. The solution of these algebraic equations Is 


(xexN? = C—ioB)ia, 


where the index —1 denotes the inverse matrix. 

On the other hand, the desired components of the spectral resolution 
(122.11) are expressed in terms of the components of the “one-sided” 
expansion (122.15) by 


(xixRo = (erxOEP + (xnx)™ ; (122.16) 
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this is easily seen by writing the integral from — to o as a sum of two 
integrals from — co to 0 and from 0 to ©, replacing t by —t in the first, and 
using the symmetry property (119.2). Thus we have finally 


(1x1) = (C— iB) he + (6 + iB), fa. (122.17) 


By the symmetry properties of the matrices ¢,, and B,,, the quantities (122.17) 
necessarily have the properties (122.13) and (122.14).* 

The results obtained above can be put in a different form by using in the - 
relaxation equations “random forces” in the same way as was done at the 
end of § 118 for a single fluctuating quantity. The correlation properties 
of these forces take an especially simple form if they are brought into the 
equations written in terms of thermodynamically conjugate quantities, as 
in (120.5) or (120.13). For example, with random forces Y,, the equations 


(120.13) become 
Xi — Cx t Vi; (122.18) 


the quantities Y, may be neglected when the x, exceed their mean fluctuations. 
As in the derivation of (122.10), a simple calculation gives for the spectral 
resolution of the correlation functions of the random forces 


(ViY no = Cie tCei- (122.19) 


As in (122.10), these are independent of the frequency. 
If random forces y, are used in equation (120.5), 


Xi = —VUXn+Yis (122.20) 
their correlation function is given by the analogous formula 
(Wo = Vik t+ Vai (122.21) 


This is obvious without further calculations if we again use the reciprocity 
of the relation between the x, and the X; (see the second footnote to § 111). 
The advantage of formulae (122.19) and (122.21) is that they involve the 
elements of the matrices ¢,, and y, themselves, and not of their inverses.t 


t The matrix of the quantities 8, is always symmetrical. But, if certain x, and x, 
behave differently under time reversal, the corresponding By, = 0. This follows 
because £;, is the coefficient of the product x;x, in the quadratic form (111.1) 
which determines the change of entropy in a deviation from equilibrium. Since the 
entropy is invariant under time reversal, while the product x,x, changes sign, 
the entropy cannot contain any such term, and we must have fi, = 0. 

t Since the expressions (122.19) and (122.21) are independent of the frequency» 
it follows (as with (122.10) for a single fluctuating quantity) that the correlation 
functions (Y,(#) Y,(0)) and (y,() »;,(0)) themselves contain a delta function of time: 


LVAD ¥(0)) = (Ye +V eg) OO. (122.21a) 
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As an example of the use of the above formulae, let us consider fluctua- 
tions of a one-dimensional oscillator, i.e. a body which is at rest in the equi- 
librium position (Q = 0) but capable of executing small oscillations in some 
macroscopic coordinate Q. Because of fluctuations, the coordinate Q will 
in fact undergo deviations from the value Q = 0. The mean square of this 
deviation is determined directly from the coefficient in the quasi-elastic force 
which acts on the body during a deviation. 

We write the potential energy of the oscillator in the form U = tmw;0’, 
where m is the “mass” (i.e. the coefficient of proportionality between the 
generalised momentum P and the velocity Q : P = mQ), and wo the natural 
frequency of the oscillator (in the absence of friction). Then the mean square 
fluctuation is (cf. § 112, Problem 7) 


(Q%) = Time. (122.22) 
The spectral resolution of the fluctuations in the coordinate will be ob- 


tained for the general case where the oscillations are accompanied by friction. 
The equations of motion of an oscillator with friction are 


OQ = Pim, (122.23) 
P = —mo2Q—yP/m, (122.24) 
where —yP/m = —yQ is the frictional force. As shown in § 121, if Q and 
P are taken as x, and x, the corresponding X, and X, are mw?Q/T and 


P/mT. Equations (122.23) and (122.24) then represent the relations x, = 
—Y4X,, sO that 


yu=0, ypr=—-yu=—-T, yee = yT. 
In order to apply these equations to the fluctuations, we write (122.24) as 
P =—mw3Q—yP/m+y, (122.25) 
with the random force y on the right; equation (122.23), which is the defi- 
nition of the momentum, must be left unchanged. According to (122.21) 


we have immediately the spectral density of the fluctuations of the random 


force 
(Yo = 2yea = 2yT. (122.26) 


Finally, in order to derive the required (Q?),, we substitute P = mQ in 
(122.25), obtaining . 
m0 +7Q+mazQ = y. (122.27) 


Multiplying by e’ and integrating over time, we find 
(—mo?— iwy + m05)On = Yoo 


and hence finally 
(QO), = 2yT|[m?(w? — wo)? + o*y*], (122.28) 
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§ 123. The generalised susceptibility 


It is not possible to derive a general formula for the spectral distribution 
of arbitrary fluctuations analogous to formula (122.9) for the quasi- 
Stationary fluctuations, but in many cases it is possible to relate the 
properties of fluctuations to quantities describing the behaviour of the 
body under certain external interactions. These may be fluctuations of 
either classical or quantum quantities. 

Physical quantities of this type have the property that for each of them 
there exists an external interaction described by the presence, in the Hamil- 
tonian of the body, of a perturbing operator of the type 


P = ~sf(0), (123.1) 


where ¥ is the operator of the physical quantity concerned, and the perturb- 
ing generalised force f is a given function of time. . 

The quantum mean value X is not zero when such a perturbation is present 
(whereas X == 0 in the equilibrium state in the absence of the perturbation), 
and it can be written in the form &f, where & is a linear integral operator 
whose effect on the function f(¢) is given by a formula of the type 


#() = af = f a() f(t—n) dr, (123.2) 


a(t) being a function of time which depends on the properties of the body. 
The value of X at time ¢ can, of course, depend only on the values of the 
force f at previous (not subsequent) times; the expression (123.2) satisfies 
this requirement. The quantity X(?) is called the response of the system to the 
perturbation. 

Any perturbation depending on time can be reduced by means of a Fourier 
expansion to a set of monochromatic components with a time dependence 
et, Substituting in (123.2) f and x in the forms f,e7' and x,e', we 
find as the relation between the Fourier components of the force and the 


response 
Xm = a(w) fe» (123.3) 


where the function «(w) is given by 


a(w) = f a(nel dr. (123.4) 

0 
If this function is specified, the behaviour of the body under the perturbation 
in question is completely determined. We shall call‘ «(w) the generalised 


t For example, f may be an electric field and x the electric dipole moment 
acquired by the body in that field. Then « is the electric polarisability of the body. 
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susceptibility. It plays a fundamental part in the theory described below, since, 
as we shall see, the fluctuations of the quantity x can be expressed in terms 
of it.f 

The function «(w) is in general complex. Let its real and imaginary parts 
be denoted by «’ and «”: 


(a) = a’(w) + iax’’(@). (123.5) 
The definition (123.4) shows immediately that 
a(—w) = «*(@). (123.6) 
Separating the real and imaginary parts, we find 
a(—w) = «(w), «’(—w) = —a’(w), (123.7) 


ic. «(w) is an even function of the frequency, and «” an odd function. 
When w = 0 the function «”(w) changes sign, passing through zero (or in 
some cases through infinity). 

It should be emphasised that the property (123.6) simply expresses the 
fact that the response X must be real for any real force f. If the function / is 
purely monochromatic and is given by the real expression 


FO = re feet = 3 (frei +fitel™), (123.8) 
then by applying the operator « to each of the two terms we obtain 
¥ = F{a(o) foe + a( 0) fore]; (3-2) 


the condition for this expression to be real is just (123.6). 

As@ + co, the function «(w) tends to a real finite limit «,,. For definiteness 
we shall suppose below that this limit is zero; a non-zero «,, requires only 
some obvious slight changes in some of the formulae. 

The change in state of the body as a result of the “force” fis accompanied 
by absorption (dissipation) of energy; the source of this energy is the external 
interaction, and after absorption in the body it is converted into heat. This 
dissipation also can be expressed in terms of the quantity «. To do so, we use 
the equation dE/dt = 0H/0t, which states that the time derivative of the 
mean energy of the body is equal to the mean value of the partial derivative 
of the Hamiltonian of the body with respect to time (see§ 11). Since only the 
perturbation V in the Hamiltonian depends explicitly on the time, we have 


dE/dt = —x df/dz. (123.10) 


t The quantity «(w) thus defined is more convenient than the quantity Z(w) = 
—1/iwa(q), called the generalised impedance, which is the coefficient in the rela- 
tion f, = Z(@)xX,- 
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This relation is of importance in applications of the theory under discussion. 
If we know the expression for the change in energy in a particular process, 
a comparison with (123.10) will show which quantity is to be interpreted as 
the “force” f with respect to a given variable x. 

Substituting x and f from (123.8) and (123.9) in (123.10) and averaging 
over time, we find the mean energy Q dissipated per unit time in the system 
under a monochromatic perturbation. The terms containing e+™@* vanish 
on averaging, and we obtain‘ 


Q = hiw(a*—«) | fol? = twa’) | fol?. (123.11) 


From this we see that the imaginary part of the susceptibility determines the 
dissipation of energy. Since any actual process is always accompanied by 
some dissipation (Q > 0), we reach the important conclusion that, for all 
positive values of the variable w, the function «” is positive and not zero. 

It is possible to derive some very general relations concerning the function 
a(w) by using the methods of the theory of functions of a complex variable. 
We regard w as a complex variable (w = w’ +iw’’) and consider the properties 
of the function «(@) in the upper half of the w-plane. From the definition 
(123.4) and the fact that «(¢) is finite for all positive 1, it follows that «(w) 
is a one-valued regular function everywhere in the upper half-plane. For, 
when w” > 0, the integrand in (123.4) includes the exponentially decreasing 
factor e~'”” and, since the function «(¢) is finite throughout the range of 
integration, the integral converges. The function «() has no singularity on 
the real axis (w” = 0), except possibly at the origin.? It is useful to notice 
that the conclusion that «() is regular in the upper half-plane is, physically, 
a consequence of the causality principle. Owing to this principle, the inte- 
gration in (123.2) is taken only over times previous to t, and the range of 


T If the function f(s) is not purely monochromatic but the perturbation acts 
over a finite time (f-— 0 as|7| -> oo), the total energy dissipation in the whole 
time is expressed in terms of the Fourier component of the perturbation by the 
integral 

daw 
{ Qdt=— | iwa(w) fool” 


- | Deva’) fl. 


+ In the lower half-plane, the definition (123.4) is invalid, since the integral 
diverges. Hence the function «(@) can be defined in the lower half-plane only as 
the analytical continuation of the expression (123.4) from the upper half-plane, 
and in general has singularities in this region. 
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integration in (123.4) therefore extends from 0 to o rather than from 
—co tO oc, 


It is evident also from the definition (123.4) that 
a(—w"*) = a*(w). (123.12) 


This generalises the relation (123.6) for real w. In particular, for purely 
imaginary w we have a(iw”) = «*(iw”), i.e. the function «(w) is real on the 
imaginary axis. 

We shall prove the following theorem. The function «(w) does not take real 
values at any finite point in the upper half-plane except on the imaginary 
axis, where it decreases monotonically from a positive value a > 0 at 
tw = i0 to zero at w = i oo. Hence, in particular, it will follow that the func- 
tion «(w) has no zeros in the upper half-plane. 

To prove the theorem we use a theorem from the theory of functions of a 
complex variable, according to which the integral 


1 (daw) da 
aa | do a(w)—a’ ey) 


taken round some closed contour C, is equal to the difference between the 
number of zeros and the number of poles of the function «(w)—a in the region 
bounded by the contour. Let a be a real number and let C be taken as a 
contour consisting of the real axis and an infinite semicircle in the upper 
half-plane (Fig. 53). Let us first suppose that a» is finite. Since in the upper 


Oo 
) @ 
Cc 
-0 0 +00 0 Q Qo 
Fic. 53 


half-plane the function «(w) has no pole, the same is true of x(«) —a, and the 
integral in question gives simply the number of zeros of the difference «—a, 
ie. the number of points at which «(w) takes the real value a. 


T The proof given here is due to N. N. Meiman. 
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To calculate the integral, we write it as 


1 da 
Qn | a—a’ 
Fy! 

the integration being round a contour C” in the plane of the complex variable 
a which is the map of the contour C in the w-plane. The whole infinite semi- 
circle is mapped on to the point « = 0, and the origin (w = 0) is mapped on 
to another real point ao. The right and left halves of the real axis of w are 
mapped in the «-plane on to some very complicated (generally self-intersect- 
ing) curves which are entirely in the upper and lower half-planes respectively. 
It is important to note that these curves nowhere meet the real axis (except 
at a = 0 and « = a»), since « does not take real values for any real finite w 
except w = 0. Because of this property of the contour C’, the total change of 
the argument of the complex number «—a on passing round C’ is 2x (if a 
lies between 0 and a9 as shown in Fig. 53) or zero (if a lies outside that range), 
whatever the number of self-intersections of the contour. Hence it follows 
that the expression (123.13) is equal to 1 if0 < a < aq and zero for any other 
value of a. 

Thus we conclude that the function «(w) takes, in the upper half-plane of 
@, each real value of a in this range once only, and values outside this range 
not at all. Hence we can deduce first of all that on the imaginary axis, where 
the function «(w) is real, it cannot have either a maximum or a minimum, 
since otherwise it would take some values at least twice. Consequently, «(@) 
varies monotonically on the imaginary axis, taking on that axis, and nowhere 
else, all real values from «o to zero once only. 

If ao = 00 (i.e. a(w) has a pole at the point w = 0), the above proof is 
affected only in that on passing along the real axis (in the w-plane) it is 
necessary to avoid the origin by means of an infinitesimal semicircle above 
it. The change in the contour C’ in Fig. 53 can be regarded as the result of 
moving «o to infinity. The function «(w) then decreases monotonically from 
eo to 0 on the imaginary axis. 

Let us now derive a formula relating the real and imaginary parts of the 
function a(w). To do so, we choose some real positive value wo of w and 
integrate the expression «/(w—qwo) round the contour shown in Fig. 54. This 


c ) 
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contour includes the whole of the real axis, indented upwards at the point 
@® = Wo > 0(and also at the point w = Oif the latter is a pole of the function 
a(w)), and is completed by an infinite semicircle. At infinity, « > 0, and the 
function «/(w—o) therefore tends to zero more rapidly than 1/w. The 


integral 
| a(w) deo 
@O—Wo 


consequently converges; and since «(w) is regular in the upper half-plane, and 
the point @ = wo has been excluded from the region of integration, the func- 
tion «/(co—@o) is analytic everywhere inside the contour C, and the integral 
is therefore zero. 

The integral along the infinite semicircle is also zero. The point wo is 
avoided by means of an infinitesimal semicircle whose radius ¢ tends to zero. 
The direction of integration is clockwise, and the contribution to the integral 
is —ista(wo). If xo is finite, the indentation at the origin is unnecessary, and 
the integration along the whole real axis therefore gives 


@o--2 oo 
lim { = do+ | * __ deo \—ima(wo) = 0. 
e-0 @—Wo O—Wo 

oo Mo+e 


The first term is the principal value of the integral from — © to oo. Indicating 
this in the usual way, we have 


oo 


ima(wo) = P { 


snap (123.14) 


Here the variable of integration w takes only real values. We replace it by &, 
call the given real value w instead of wo, and write the function «(w) of the 
real variable w in the form « = «’ +ia’’. Taking the real and imaginary parts 
of (123.14), we obtain the following two formulae: 


a’(w) = =P | Co dé, (123.15) 
aw) = =P fo dé, (123.16) 


called the dispersion relations and first derived by H. A. Kramers and 
R. de L. Kronig (1927). It should be emphasised that the only essential 
property of the function «(w) used in the proof is that it is regular in the 
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upper half-plane. Hence we can say that Kramers and Kronig’s formulae, 
like this property of «(w), are a direct consequence of the causality principle. 
Using the fact that «’’(&) is an odd function, we can rewrite (123.15) as 


~ a’(E) 
E+a 


lof ’® «1 
a(o) = = P[ ars di+— P 


v 
0 


dé, 


0 
or 


a’(@) = 2p Sa") ag. (123.17) 


E2— ey? 
0 


If the function «(w) has a pole at the point w = 0, near which a = iA/a, 
the semicircle avoiding this pole gives a further real term —A/w, which must 
be added to the left-hand side of equation (123.14). Thus formula (123.16) 
becomes 


«’(w) = -ip | 26) a+“, (123.18) 


but (123.15) and (123.17) remain unchanged. 

We may also derive a formula which expresses the values of «(w) on the 
positive imaginary axis in terms of the values of «’’(@) on the real axis. To do. 
So, we Consider the integral 

coa(co) 
w* +2 


taken along a contour consisting of the real axis and an infinite semicircle 
in the upper half-plane (w» being a real number). This integral can be 
expressed in terms of the residue of the integrand at the pole w = iw o. The 
integral along the infinite semicircle is zero, and so we have 
wa(w) Sa Sgoaehe 
| coe dw = iza(iwo). 


On the left-hand side the real part of the integral is zero, since the integrand 
is an odd function. Replacing w by & and wp» by o, we have finally 


Zz ~ Ea’) 


m | w+ & 
0 


dé. (123.19) 


a(iw) = 


t The property « - 0 as w -> © is not essential: if the limit «, were other 
than zero, we should simply take «—«,, in place of «, with corresponding obvious. 
changes in formulae (123.15), (123.16). See also § 126, Problem. 
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Integration of this with respect to w gives 


[ a(io) do = f a”(w) do. (123.20) 


0 0 


§ 124. The fluctuation—dissipation theorem 


Let us now proceed to calculations which are designed to relate the fluctua- 
tions of a quantity x to the generalised susceptibility defined in § 123. 

Let a body to which the quantity x refers be in a particular (nth) stationary 
state. The mean value (122.8) is calculated as the corresponding diagonal 
matrix element 


L(R Ro + Ra Sadan = FY [Salam %er)nn+ (%eam (%emnly (124.1) 


where the summation is over the whole spectrum of energy levels; since the 
operator %,, is complex, the two terms in the brackets are not equal. 

The time dependence of the operator £(¢) means that its matrix elements 
must be calculated by means of the time-dependent wave functions. We 
therefore have 


(Xe)am = f Xnmeloamt ot dt = W1Xnm 6(Onm +O); (124.2) 
where x,,, is the ordinary time-independent matrix element of the operator &, 
expressed in terms of the coordinates of the particles of the body, and 


Qam = (E,—£E,,)/h is the frequency of the transition between the states n 
and m. Thus 


F(Rwkur + Swann = 2? Y | Xam|* [5(Onm +O) (Onn +O’) 
+ 6(Onm +0’) O(Omn+o)], 


where we have used the fact that Xm, = Xmas Since x is real. The products of 
delta functions in the brackets can clearly be written as 


6(Onm +@) 6(@ +0’) + O(@mn +) 6(@ +0’). 
A comparison with (122.8) then gives 
(x7) = 0D) | Xam |? [6( + Onm) + 5(O +0mn))- (124.3) 
The following comment may be made concerning the way in which this 
expression is written. Although the energy levels of a macroscopic body are, 


strictly speaking, discrete, they are so close together that in practice they 
form a continuous spectrum. Formula (124.3) may be written without the 
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delta functions if it is averaged over small frequency intervals (which never- 
theless contain many levels). If /(E) is the number of energy levels less than E, 


then 


(x?) = ah laa set ae 


dE, + dE.’ (124.4) 


where E,, = E, +ho, Ej, = E,—ho. 
Let us now assume that the body is subject to a periodic perturbation (with 
frequency w), described by the operator 


VP =f =—F(foe7™ +fdel)z. (124.5) 
Under the effect of the perturbation the system makes transitions, and the 
probability (per unit time) of the transition n > m is given by 


2 
Winn = mo | Xmn [2 {d(@ +0mn) + d(@ +nm)}; (124.6) 


see Quantum Mechanics, § 42. The two terms in this formula correspond to 
those in (124.5). In each transition the system absorbs or emits a quantum 
AOmy: The sum 

Q= 2 Wnt mn 


is the mean energy absorbed by the body per unit time; this energy is supplied 


by the external perturbation, and after absorption in the body it is dissipated 
there. Substitution of (124.6) gives 


O = Fel Fol? Yl xnml? {5(0 +e>mx) + (0 + Onn) } mn 
or, since the delta functions are zero except when their argument is zero, 
Q= 5 Fol? S| Xnm ? {5(@-+ Onn) —H(@+Omr)}- (124.7) 
Comparison of (124.7) and (123.11) gives 
(0) = FY Xaml?{(O +O pm) —B(O +O nn}. (124.8) 
The quantities (x*),, and «’” thus calculated are related in a simple manner 
but the relation appears only when these quantities are expressed in terms of 


the temperature of the body. To do this, we average by means of the Gibbs 
distribution (cf. the first footnote to§ 118). For (x?),, we have 


(x") = 7 nD On|Xnm |? {d(@ +@nm) =f d(@ +Wmn)} 


26* 
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where for brevity we have put 9, = e“—*"", E, denoting the energy levels 
of the body and F its free energy. Since the summation is now over both 
suffixes m and n, these can be interchanged. If this is done in the second term, 
we obtain 

(x*),. = 7 » (On + Om) | Xnm |? 6(@ + Onm) 


= mY. On(1 term!) | Xnm|® 8(0 + Onm) 


or, because of the delta function in the summand, 


(x2)y = 01 + EFT) Sn | Xnm|? 6(O + Onm): 
In an exactly similar manner we obtain 
a” = x (1—eFIT) Sn | Xnm |? 6(@ + On) 


A comparison of these two expressions gives 


i ho _, {1 1 
(x?) = fa’ coth a = 2ha {> + geri} (124.9) 


The mean square of the fluctuating quantity itself is given by the integral 


h hw 
2. Zé 
x) = = | a”’(w) coth ya do. (124.10) 


These important formulae constitute the fluctuation-dissipation theorem 
(FDT), established by H. B. Callen and T. A. Welton (1951). They relate 
the fluctuations of physical quantities to the dissipative properties of the 
system when it is subject to an external interaction. It should be noted that 
the factor in the braces in (124.9) is the mean energy (in units of #w) of an 
oscillator at temperature T; the term 4 corresponds to the zero-point oscil- 
lations. 

As at the end of § 118 the results obtained above can be written in a differ- 
ent form by formally regarding the spontaneous fluctuations of the quantity 
x as due to the action of some fictitious random forces. It is convenient to 
write the formulae in terms of the Fourier components x,, and /,, as if x 
were a classical quantity. The relation between them is 


Xo = ao) fo, (124.11) 


similarly to (123.3), and the mean square fluctuations can then be written 


in the form 
(XwXeo') = a(n) oc’) (for fee’)s 
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or with the spectral densities of fluctuations, by the definition (122.4): 
(x?),, = a(@) «(—@) (fF) 
= |a(o)|?(f?)o- 
The spectral density of the mean square of the random force is accordingly, 
from (124.9), 


_ hie’ (w) ho 
H° = Tafaye OP Br” 


(124.12) 
This treatment may offer certain advantages in particular applications of the 
theory. 

The derivation of the FDT is based on regarding the external interaction 
(124.5) as a small perturbation; this also ensures that the response of the 
system is linear (there is a linear relation between X and the force f). It 
should be emphasised, however, that this does not imply any physical re- 
strictions on the permissible values of the mean fluctuation of x itself. The 
smallness of the interaction can always be achieved by making sufficiently 
small the auxiliary quantity f, which does not appear in the final formulation 
of the FDT. Thus, for the class of physical quantities x under consideration, 
the properties of their fluctuations (in a system in thermodynamic equilib- 
rium) are entirely determined by the response of the system to an infinitesi- 
mal external interaction. 

At temperatures T >> fw we have coth(fw/2T) ~ 2T/fw, and formula 
(124.9) becomes 

(x*),. = (2T/w) «’’(w). (124.13) 
The constant # no longer appears, in accordance with the fact that under 
these conditions the fluctuations are classical. 

If the inequality J >> he is valid for all frequencies of importance (those 
for which «”(w) is significantly different from zero), then we can take the 
classical limit in the integral formula (124.10) also: 


(=F | am 


“1 4 


But from (123.17) this integral can be expressed in terms of the static value 
«’(0) = «(0), and hencet 

(x?) = Ta(0). (124.14) 

t This expression can also be derived directly from the Gibbs distribution in 


classical statistics. Let x = x(q, p) be a classical quantity, With a term —xf 
(f constant) in the energy of the system, the mean value of x is 


F—E(q, ; 
dex [ pee (q meee p)f ! dq dp. 


By definition, <(0) = d%/df when f -— 0; differentiating the above expression, we 
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Lastly, let us consider the relationship between these results and the theory 
of quasi-stationary fluctuations (§ 118). First of all, we note that, if x is such 
that its fluctuations are small in the sense implied in § 110 (i.e. the expansion 
(110.3) of the entropy is permissible), the mean square (x?) = 1/8. A compar- 
ison with (124.14) shows that for such a quantity 


a(0) = 1/8T. (124.15) 


Next, let x be a quantity of the class whose fluctuations are quasi-station- 
ary. We assume that the body is subject to the action of a static force 
This causes a displacement of the equilibrium state, in which X is now not 
zero but «(0)f = f/B7. The macroscopic equation of the relaxation of a sys- 
tem far from equilibrium will then have the form 


& = —Mx—f/pr), (124.16) 


which differs from x = —Ax (118.5) in that X is zero not when x = 0 but 
when x = f/AT. 

Equation (124.16) may be regarded as valid also when the body is subject 
to the action of a time-dependent perturbation, provided that the period of 
variation of the force /(7) is large compared with the time for the establish- 
ment of the partial equilibrium corresponding to any given value of x. If 
S(t) is a periodic function of time with frequency w, the macroscopic value 
of x(¢) will vary with the same frequency. Substituting in equation (124.16) 
f(t) and x(t) in the form (123.8), (123.9) and separating the terms in e~!” 
and e', we obtain 


—twa(o) fo = —Aa(w)fo+ (A/BT fo, 
whence 
a(o) = A/BT(A—io). (124.17) 


According to the FDT (124.9), we then have 


2A hw fiw 
Ce a ik eee Lore 
eo = Bop +o) oO" oF 


(124.18) 


find 
1 F-E 1 
(0) = 7|* exp es dq dp = ps 


the free energy F also depends on f, but the term in the derivative OF/0f disappears 
when we put f= 0, ie. ¥ = 0. 
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This result generalises formula (122.9), which relates to fluctuations of a 
classical quantity. The expression (124.18) differs from (122.9) by a factor 


MDS fee. (124.19) 


2T 2T 


which tends to unity in the classical limit (Aw « T). 

Equation (124.16) can also be viewed in another way: not as the macro- 
scopic equation of motion of a system far from equilibrium (under an ex- 
ternal interaction), but as the equation for the fluctuations of the quantity 
x(t) in an equilibrium closed system under the action of the random force 
f. In this interpretation, it corresponds to equation (118.9), and the two 
definitions of the random force differ only by a factor: y = Af/TB. The 
spectral density (y?),, is found by substituting (124.17) in (124.12): 


21 fiw hw 
Oo = Fp Oth oF» 


(124.20) 
which differs from the previous expression (122.10) by the same factor 
(124.19). 


§ 125. The fluctuation—dissipation theorem for more than one variable 


The FDT can easily be generalised to the case where several fluctuating 
quantities x, are considered simultaneously. The generalised susceptibilities 
in such a case are determined from the response of the system to a pertur- 
bation having the form 


P = —Z,f()), (125.1) 


and are the coefficients in the linear relation between the Fourier components 
of the mean values x,(4) and the generalised forces /,(t): 


Xion = Lik) fro « (125.2) 


The change in the energy of the system is expressed in terms of the external 
perturbation by 


E=-fx. (125.3) 


This formula, like (123.10), is generally used in particular applications of the 
theory to establish the actual relationship between the quantities x, and /,. 

The spectral densities of fluctuations are defined in terms of the mean 
values of the symmetrised operator products: 


é 


3 RiReat + Lear kin) = 2 XiXp)y 6(O +0’), (125.4) 
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which generalise the expression (122.8). A calculation of this mean value as 
the diagonal (nn) matrix element, as in the derivation of (124.3), gives the 
result 

(XiXn)o = % py {(Xi)nm (Xk)nn 1(@ +Onm) 


+ (x)am (Xi) mn 8(@ + ©mn)}- (125.5) 
Let a periodic perturbation with 
Fi) = Foie + foie) (125.6) 
act on the system. The response is 
X(t) = Florin(@) fore!" + ah w) foe]. (125.7) 


Substituting these two equations in (125.3) and averaging over the period 
of the perturbation, we obtain instead of (123.11) the following expression 
for the energy dissipation: 


Q = loo ae— ani) forsee - (125.8) 
On the other hand, a calculation similar to the derivation of (124.7) gives 


Q = oa @ > foifoid i) mn (Xn) nm d(@ +@nm) 


a (Xi) am (%k)mn d(@ + Omn)]> 


and a comparison with (125.8) shows that 


oo 
Cie — OK) = = Y [xD mn (Xinm 8(@ + Onn) 


(Dam (Xt) mn (+0 mn). (125.9) 
Finally, averaging (125.5) and (125.9) over the Gibbs distribution as in 

§ 124, we find the following generalisation of the FDT (124.9): 
(XiXio = S$th(ats—cci,) coth (he /27). (125.10) 


As in formulae (124.11) and (124.12), we can express (125.10) in terms of 
fictitious random forces whose action would give a result equivalent to 
the spontaneous fluctuations of the x, To do so, we write 


Xiw = Citfiwos Siw = © ieXkew (125.11) 
and 


(fifo = 0H tem X1Xm)o 
Substitution of (125.10) gives 
(fife = 4ih(a ia if) coth (fieo/27). (125.12) 
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From these results we can draw some conclusions concerning the sym- 
metry properties of the generalised susceptibilities «,,(@) (H. B. Callen, M. L. 
Barasch, J.L. Jackson and R.F. Greene, 1952). Let us first suppose that the 
quantities x, and x, are invariant under time reversal; then their operators 
x, and &, are real. We shall also suppose that the body has no magnetic 
structure (see the first footnote to § 128) and is not in an external magnetic 
field; then the wave functions of its stationary states are also real,’ and con- 
sequently so are the matrix elements of the quantities x. Since the matrices. 
Xam ate Hermitian, we have Xm = Xmn = Xm Then the right-hand side 
of (125.9), and consequently the left-hand side, are symmetrical in the suffixes. 
iand k. Hence a, —a%) = %jy—%y, OT ay +0, = %yj tay), Le. we conclude 
that the real part of «4, is symmetric. 

But the real and imaginary parts (a,, and «;,) of each a, are related by 
linear integral equations, namely Kramers and Kronig’s formulae. Hence 
the symmetry of a, implies that of a, and therefore that of «, itself. 
Our final result is therefore 


yx () = opi). (125.13) 


These relationships are somewhat modified if the body is in an external 
magnetic field H. The wave functions of a system in a magnetic field are not 
real, but have the property y*(H) = y(—H). Accordingly the matrix ele- 
ments of the quantities x are such that X%sn(H) = %mn(—H), and the 
expression on the right of (125.9) is unchanged, when the suffixes i and k are 
transposed, only if the sign of H is simultaneously changed. We therefore 
obtain the relation 


oH) — oC) = of:(—H)—ci(—H). 


Another relation is given by Kramers and Kronig’s formula (123.14), 
according to which 
Chi = iS (axi), 


where J is a real linear operator. Adding this to the Hermitian conjugate 
equation «;, = —iJ(a,), we obtain 


ik FOR = —if (ix —OKi)3 


here all the x, are, of course, taken for a fixed value of H. Hence we see that, 
if the difference «,—a,, has a particular symmetry property, then so has. 


t The exact energy levels of a system of interacting particles can be degenerate 
only with respect to the directions of the total angular momentum of the system. 
This source of degeneracy can be eliminated by assuming the body to be enclosed 
in a vessel with immovable walls. The energy levels of the body will not then be 
degenerate, and so the corresponding exact wave functions can be taken as real. 


392 Fluctuations 
the sum «;, +a,;, and therefore «,, itself. Thus 
ao; H) = ono; —H). (125.14) 


Finally, let the quantities x include some which change sign under time 
reversal. The operator corresponding to such a quantity is purely imaginary, 
and SO Xn = Xmn =—Xmn- If the two quantities x,, x, are both of this 
kind, the derivation of (125.13) is unaffected, but if only one of them changes 
sign under time reversal, the right-hand side of equation (125.9) changes 
sign when the suffixes i, k are interchanged. Accordingly (125.13) becomes 


a () = —ax;(@). (125.15) 
For a body in a magnetic field we have 
in; H) = — Ki (3 —H). (125.16) 


All these relations can, of course, be derived also from formula (125.10) 
as a consequence of the symmetry of the fluctuations with respect ot time. For 
example, if the two quantities x, and x, behave in the same way under time 
reversal, then the symmetry mentioned implies that (x,x,),, is real and sym- 
metric in the suffixes 7 and k (see§ 122). Then the right-hand side of (125.10) 
must also be symmetric in these suffixes, and we return to the result (125.13). 
This derivation of the symmetry properties of the generalised susceptibilities 
is analogous to that of the principle of the symmetry of the kinetic coefficients 
in § 120; we shall see below that formulae (125.13)— (125.16) may beregard- 
ed as a generalisation of that principle. 

The relation between the generalised susceptibilities and the kinetic co- 
efficients is shown by comparing the FDT with the theory of quasi- 
stationary fluctuations of several variables. We shall give the relevant for- 
mulae without reiterating all the arguments similar to those at the end of 
§ 124 for the case of a single variable. 

The static values of the susceptibilities are related to the entropy expan- 
sion coefficients 6, by 

Tr azx(0) = Bik ‘ 


The displacement of the equilibrium state when static forces f, act on the 
system is therefore given by 
Xi = a0) fe = Bo felT, Xi = Bux = filT. 


The macroscopic equations of motion of a non-equilibrium system under 
the action of quasi-static forces f,(f) may be written 


Xi = —yir(Xe—fxlT), (125.17) 
which differs from (120.5) in that X, is replaced by X,—/;,/T. 
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Substituting in (125.17) x,(4) and 7,(4) as the periodic functions (125.6), 
(125.7) (with the X, written as the linear combinations X, = B,%)), we 
obtain 


: 1 
ra iwocimfom Se Vik Bkitimfom ie T Vimfom > 
whence, since the f,, are arbitrary, we have the relations 


: 1 
—1OKim + ViePt%im = T Vim 


or 
si aise) 95.18 
ik = a (B lay ik. (1 5.18) 


These are the required relations between the «,, and the kinetic coefficients 
Vik 

The quantities £,, are by definition symmetric with respect to their suffixes 
(being the derivatives —02S/0x, 0x,). Hence the symmetry of the «,, implies 
that of the y,, i.e. the ordinary principle of the symmetry of the kinetic 
coefficients. 

Regarding the f, in equations (125.17) as random forces, and substituting 
(125.18) in (125.12), we obtain 


hoT . _ 2 h 
Uifde = >-( ik ty iu) coth =. 


If the random forces y, are defined as in (122.20), then y, = y,,f,/T, and their 
spectral distribution is 


h h 
Ono = ep +i) coth = : (125.19) 


This relation differs from formula (122.21) by the same factor (124.19), 
which tends to unity in the classical limit. 


§ 126. The operator form of the generalised susceptibility 


The FDT can also be regarded from the opposite side, taking equation 
(124.9) from right to left and writing (x?),, explicitly as the Fourier compo- 
nent of the correlation function: 


oo 


oo) = a tanh ~ { elt(Z(0) #(1) + 2(1) 2(0)) dt. (126.1) 
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In this form, the equation in principle allows the function «’(@) to be calcu- 
lated from the microscopic properties of the system. Its disadvantage is that 
it gives directly only the imaginary part of «(w), not the whole function. 
A similar formula can be obtained that does not suffer from this disadvan- 
tage. To derive it, we make a direct quantum-mechanical calculation of the 
mean value x in the perturbed system, with the perturbation operator (124.5)." 

Let Y® be the wave functions of the unperturbed system. Following the 
general method (see Quantum Mechanics, § 40), we seek the wave functions 
of the perturbed system in the first approximation as 


Fy = POLY Gan PO, (126.2) 


where the coefficients a,,,, satisfy the equations 


damn 
dt 


= V mneiennt — + Xmnelomn'( foe iat +foeia), 


ih 
In solving this, we must assume that the perturbation is “adiabatically” 
applied until the time ¢ from t = — (cf. Quantum Mechanics, § 43); this 
means that we must put @ + w+i0 in the factors et'* (where i0 denotes 
id with 6 - +0). Then 


an 


—iwt ieot 
elma a aw fe ee De | (126.3) 


1 
= Rm Omn—@ — 10 Omn +o +10 |” 

With the function Y, thus found, we calculate the mean value ¥ as the 
corresponding diagonal matrix element of the operator £. In the same ap- 
proximation, 


> — [ vse, dg 
d 


= ry (QinXnmelnm' + GnnX mn! mnt) 


1 
= Sa Sa Ss ee —iwt 
2h Xam n—@ —10 + soterm fe 
-+complex conjugate. 


Comparison of this result with the definition (123.9) shows that 


a(w) = >) | Xmen |? eee ees ' (126.4) 


+ This is more direct than using Kramers and Kronig’s relations to find «’(w), 
and thence a(@), from«’’(q). 
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The real and imaginary parts of this expression can be separated by means 
of the formula 
1 
x+i0 


= P— Finda); (126.5) 


see Quantum Mechanics, (43.10). For «’’(@) we recover, of course, the previ- 
ous result (124.8). 

It is easy to see that the expression (126.4) is the Fourier transform of the 
function 


t< 


a(t) = | ae (&(1) (0) —-XO) KY), tt > O~ (126.6) 
(as with the correlation function, this mean value depends, of course, only 
on the difference between the times at which the two operators %(f) are 
taken). For, calculating the function (126.6) as the diagonal matrix element 
with respect to the nth stationary state of the (unperturbed) system, we have 
fort >0 


28) = $Y Lenml#)xnn(0)—Hun(0) Xn) 


= $ 2 eee \2 [elmnt — gimnl], 


where the change to time-independent matrix elements is made by the usual 
rule: 

Xnmlt) = Xnmelnmt , 
Since the function «(f) 1s zero except when ¢ > 0, its Fourier transform is 
calculated ast 


; i 
eiot d¢ = oxi’ (126.7) 
| 
and is the same as (126.4). 
Thus we have as the final result 
a(@) = | elt R(t) 2(0) — X(0) (0) dz (126.8) 


(R. Kubo, 1956). Being valid for averaging over any given stationary state 
of the system, this formula also remains unchanged after averaging over the 
Gibbs distribution. 


t The integral is calculated by deflecting the contour of integration (in the com- 
plex t-plane) upwards or downwards according to the sign of a, i.e. replacing ¢ by 
1(1+26 sgn w) and then making 6 + -++0. 
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The generalised susceptibilities «,,(@) which determine the response of the 
system to a perturbation affecting several quantities x, are given by a similar 
formula, 


orix() = _ [ elor( (0) x4(0)— x1(0) x4(0)) dt. (126.9) 
3 


PROBLEM 
Determine the asymptotic form of «(@) for @ -> , if «(co) = 0. 


SOLUTION. When @ -»+ co, small values of ¢ are important in (126.8). Putting 
(4) = 2(0)+74(0), we find 


a(o) & + (et 28) | te™ de, 


the uniform argument t = 0 of the operators being omitted. The integral is calcu- 
lated by differentiating (126.7) with respect to w, and gives 


a(w) = — a3 (R— KZ); (J) 


this formula is valid if the mean value of the commutator is other than zero. 
The expression (1), being an even function of «, is real, so that it is the asymptotic 
form of «’(@). From (123.15) we have with @ -+ oo 


a'(w) =— a | Ea’’(E) dé, 
0 


using the fact that «’’(&) is odd. Comparison with (1) gives the following “sum 
rule” for«’’(@); 


| wa"(w) do = “a (42— 22), (2) 


§ 127. Fluctuations in the curvature of long molecules 


In ordinary molecules, the strong interaction between atoms reduces the 
thermal motion within molecules to small oscillations of the atoms about 
their equilibrium positions, which have practically no effect on the shape of 
the molecule. Molecules consisting of very long chains of atoms (e.g. long 
polymer hydrocarbon chains) behave quite differently, however. The great 
length of the molecule, together with the relative weakness of the forces tend- 
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ing to preserve the equilibrium straight shape of the molecule, means that 
the fluctuation curvature of the molecule may become very large and even 
cause the molecule to coil up. The great length of the molecule enables us to 
consider it as a kind of macroscopic linear system, and statistical methods 
may be used in order to calculate the mean values of quantities describing its 
curvature (S. E. Bresler and Ya. I. Frenkel’, 1939).f 

We shall consider molecules having a uniform structure along their length. 
Being concerned only with their shape, we can regard such molecules as 
uniform continuous threads. The shape of a thread is defined by specifying 
at each point in it the “curvature vector” 9, which is along the principal 
normal to the curve and equal in magnitude to the reciprocal of the radius 
of curvature. 

The curvature of the molecule is in general small at each point; since the 
molecule is of great length this does not, of course, exclude very large relative 
movements of distant points of it. For small values of the vector 9, the free 
energy per unit length of the curved molecule can be expanded in powers of 
the components of this vector. Since the free energy is a minimum in the equi- 
librium position (the straight shape, with @ = 0 at every point) linear terms 
do not appear in the expansion, and we have 


F= Fy ted Ai 00k » (127.1) 


where the values of the coefficients a,, represent the properties of the straight 
molecule (its resistance to curvature) and are constant along its length, 
since the molecule is assumed homogeneous. 

The vector g is in the plane normal to the line of the molecule at the point 
considered, and has two independent components in that plane. Accordingly, 
the set of constants a,, forms a symmetrical tensor of dimension two and 
rank two in this plane. We refer this to its principal axes, and denote its prin- 
cipal values by a1, a2; the thread which represents the molecule need not be 
axially symmetrical in its properties, and so a; and dz need not be equal. The 
expression (127.1) then becomes 


F = Fy +3(a03 + 4,03), 


where 0; and 02 are the components of 0 in the direction of the corresponding 
principal axes. 


t In the theory given here, the molecule is regarded as an isolated system, its 
interaction with neighbouring molecules being neglected. In a solid or liquid 
substance this interaction may, of course, have a considerable effect on the shape 
of the molecules. Although the applicability of the results to actual substances is 
therefore very limited, their derivation is of considerable methodological! interest. 
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Finally, integrating over the whole length of the molecule, we find the total 
change in its free energy due to a slight curvature: 


AF,= 4 f (@.e2 + 4,03) di, (127.2) 


where / is a coordinate along the thread. It is clear that a, and ag are neces- 
sarily positive. 

Let t, and t, be unit vectors along the tangents at two points a and b on the 
thread separated by a section of length /, and let 6 = O(/J) denote the angle 
between these tangents, i.e. t,t, = cos 4. 

Let us first consider a curvature so slight that the angle 0 is small even for 
distant points. We draw two planes through the vector t, and the two princi- 
pal axes of the tensor a, in the normal plane (at the point a). For small 6, 
the square of this angle may be written 


6? = 62463, (127.3) 


where 6, and 6, are the angles of rotation of the vector t, relative to t, in 
these two planes. The components of the curvature vector are related to the 
functions 63(J) and 62(D) by e1 = d61()/dl, 92 = d62()/d/, and the change in 
the free energy due to the curvature of the molecule may be written 


“1 d0,\2 _ /d6a\2 
AF, =5 | |( ar) +a Sr) | dl. (127.4) 


Incalculating the probability of a fluctuation with given values of 0:(J) = 01 
and 62(/) = 62 for a particular /, we must consider the most complete equilib- 
rium possible for given 9; and 6. (see the first footnote to§ 111). That is, we 
must determine the minimum possible value of the free energy for given 01 
and 6,. An integral of the form 


I 
d6i\2 
(a) . 
3 
for given values of the function 6;(/) at both limits (0:(0) = 0, 01(/) = 91) has 
a minimum value if 61(J) varies linearly. Then 


a0} | 03 


ae) uaa Te 


the fluctuation probability w oc e~4*/T (see (116.7)), and so we obtain for 
the mean squares of the two angles 


(63) = IT/a, (03) = IT/ay. 
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The mean square of the angle 0(/) under consideration is 


1 1 
(62) = IT (ata) (127.5) 
In this approximation it is, as we should expect, proportional to the length of 
the section of the molecule between the two points. 

Curvature with large values of the angles 0(/) may now be treated as fol- 
lows. The angles between the tangents t,, t,, t, at three points a, b, c on the 
thread are related by the trigonometrical formula 


cos 0,,¢ = COS 4,4 COS O4-—sin 9,5 sin O4- cos ¢, 


where ¢ is the angle between the planes (t,, t,) and (t,, t,). Averaging this for- 
mula and bearing in mind that, in the approximation considered, the fluctua- 
tions of curvature of the sections ab and bc of the molecule (for a given direc- 
tion of the tangent t, at the middle point) are statistically independent, we 
obtain 

(CoS Fac) = (COS Og4 COS My) 


= (COS 445) (COS Ipc); 


the term containing cos @¢ gives zero on averaging. 

This relation shows that the mean value (cos 0(/)) must be a multiplicative 
function of the length / of the section of the molecule between two given 
points. But for small 9(/) we must have, according to (127.5), 


(cos (J) = 1—4(6") = 1—IT/a, 


with the notation 2/a = 1/a,;+1/az. The function which satisfies both these 


conditions is 
(cos 0) = e7!Ti, (127.6) 


and this is the required formula. For large distances /, the mean value 
(cos 6) = 0, in accordance with the statistical independence of the directions 
of sufficiently distant parts of the molecule. 

By means of formula (127.6) it is easy to determine the mean square of the 
distance R (measured in a straight line) between the two ends of the molecule. 
If t(/) is a unit vector along the tangent at an arbitrary point in the molecule, 
the radius vector between its ends is 


L 
= J t(D di, 
0 


where L is the total length of the molecule. Writing the square of this integral 
as a double integral and averaging, we obtain 


LL LL 
(R®) = [ f t()-th) dh dla = ff e-Tlh-hle diy dis. 
0 0 0 0 
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The calculation of the integral gives the final formula 
a — 2( 2) (eE_ 4 e-11 0 
(R2) 2(F) ( ——1+e-£Tie), (127.7) 
For low temperatures (LT <« a) this becomes 
(R?) = LA(1—LT/3a); (127.8) 


as T + 0 the mean square (R?) tends to the square of the total length of the 
molecule, as it should. If LT >> a (high temperatures or great lengths L), 


(R®) = 2La/T. (127.9) 


Then (R*) is proportional to the length of the molecule, and as L increases 
the ratio (R?)/L? tends to zero. 


CHAPTER XIII 


THE SYMMETRY OF CRYSTALS 


§ 128. Symmetry elements of a crystal lattice 


THE most usual properties of symmetry of macroscopic bodies relate to the 
symmetry of the configuration of particles in them. 

Atoms and molecules in motion do not occupy precisely defined places in 
a body, and for an exact statistical description of their arrangement we must 
use a density function 0(x, y, z), which gives the probability of various con- 
figurations of the particles: 9 dV is the probability that an individual particle 
is in the volume element dV. The symmetry properties of the configuration 
of the particles are determined by the coordinate transformations (transla- 
tions, rotations and reflections) which leave the function (x, y, z) invariant. 
The set of all such symmerry transformations for a given body forms what is 
called its symmetry group. 

If the body consists of different kinds of atom, the function 0 must be deter- 
mined for each kind of atom separately; this, however, is unimportant here, 
since all these functions in an actual body will in practice possess the same 
symmetry. We could also use the function o defined as the total electron 
density due to all the atoms at each point in the body.* 

The highest symmetry is that of isotropic bodies (bodies whose properties 
are the same in all directions), which include gases, liquids and amorphous 
solids. It is evident that in such a body all positions in space of any given par- 
ticle must be equally probable, i.e. we must have 9 = constant. 

In anisotropic crystalline solids, on the other hand, the density function is 
not simply a constant. In this case it is a triply periodic function (with periods 
equal to those of the crystal lattice) and has sharp maxima at the lattice 
points. Besides translational symmetry, the lattice (i.e. the function (x, y, z)) 
also has, in general, symmetry under certain rotations and reflections. The 


t Moving electrons can cause not only a mean charge density eg but also a mean 
current density j(x, y, z). Bodies in which there are non-zero currents are those 
having a “magnetic structure”, and the symmetry of the vector function j(x, y, z) 
’ determines the symmetry of that structure, This is discussed in Electrodynamics, 
§ 28. 
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lattice points which can be made to coincide by any symmetry transformation 
are said to be equivalent. 

Proceeding to study the symmetry of a crystal lattice, we must first of all 
ascertain which elements can contribute to this symmetry. 

The symmetry of a crystal lattice is based on its spatial periodicity, the 
property of being unchanged by a parallel displacement or translation through 
certain distances in certain directions;‘ translational symmetry will be further 
discussed in § 129. 

As well as translational symmetry, the lattice may also be symmetrical 
under certain rotations and reflections; the corresponding symmetry ele- 
ments (axes of symmetry, planes of symmetry, and rotary-reflection axes) 
are the same as those which can occur in symmetrical bodies of finite size (see 
Quantum Mechanics, § 91). 

In addition, however, crystal lattices can also possess symmetry elements 
consisting of combinations of parallel translations with rotations and reflec- 
tions. Let us first consider combinations of translations with the axes of sym- 
metry. The combination of an axis of symmetry with a translation in a direc- 
tion perpendicular to the axis does not give a new type of symmetry element. 
It is easy to see that a rotation through a certain angle followed by a transla- 
tion perpendicular to the axis is equivalent to a rotation through the same 
angle about an axis parallel to the first. The combination of a rotation about 
an axis and a translation along that axis leads to a new type of symmetry ele- 
ment, a screw axis. The lattice has a screw axis of order n if it is unchanged by 
rotation through an angle 27/n about the axis, accompanied by translation 
through a certain distance d along the axis. 

After n rotations, with accompanying translations, about a screw axis of 
order n, the lattice is simply shifted along the axis by a distance nd. Thus, 
when there is a screw axis, the lattice must certainly also have a simple 
periodicity along this axis with a period not exceeding nd. This means that 
screw axes of order n can be correlated only with translations through 
distances d = pa/n(p = 1,2, ...,n—1), where a is the smallest period of the 
lattice in the direction of the axis. For example, a screw axis of order 2 can 
be of only one type, the translation being through half a period; screw axes 
of order 3 can be correlated with translations by 4 or 2 period, and so on. 

Similarly, we can combine translations with planes of symmetry. Reflection 
in a plane together with translation in a direction perpendicular to the plane 
does not give a new type of symmetry element, since such a transformation 
is easily seen to be equivalent to a reflection in a plane parallel to the first. 
The combination of a reflection with a translation along a direction lying in 


* Here the crystal lattice must be regarded as infinite, ignoring the faces of 
the crystal. 
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the reflection plane leads to a new type of symmetry element, a glide-reflection 
plane or glide plane. The lattice has a glide-reflection plane if it is unchanged 
by a reflection in this plane, accompanied by a translation through a certain 
distance d in a certain direction lying in this plane. 

A twofold reflection in a glide-reflection plane amounts to a translation 
through a distance 2d. It is therefore clear that a lattice can have only glide- 
reflection planes such that the translation distance d = +a, where a is the 
smallest period of the lattice in the direction of the translation. 

The combination of rotary-reflection axes with translations does not lead 
to new types of symmetry element, since in this case any translation can be 
resolved into two parts, one perpendicular to the axis and the other parallel 
to it and therefore perpendicular to the reflection plane. Thus a rotary- 
reflection transformation followed by a translation is always equivalent to 
another rotary-reflection transformation about an axis parallel to the first. 


§ 129. The Bravais lattice 


The translational periods of a lattice can be represented by vectors a whose 
directions are those of the respective translations and whose magnitudes are 
equal to the distances concerned. The lattice has an infinity of different lat- 
tice vectors. These vectors are not all independent, however: one can always 
choose three basic lattice vectors (corresponding to the three dimensions of 
space) which do not lie in one plane, and then any other lattice vector can 
be represented as a sum of three vectors each an integral multiple of one of 
the basic vectors. If the basic vectors are denoted by a1, 2, 43, an arbitrary 
lattice vector a will be of the form 


a= nya, +nea2t nas, (129.1) 


where 71, Ne, M3 are any positive or negative integers or zero. 

The choice of the basic lattice vectors is not unique. On the contrary, they 
may be chosen in an infinity of ways. Let ai, a2, a3 be basic lattice vec- 
tors, and let us replace them by other vectors aj, a3, 83, defined by the formu- 
lae 


a; = 2 oii (i,k = 1, 2, 3), (129.2) 


where the «,, are some integers. If the new periods a; are also basic lattice vec- 
tors, then, in particular, the vectors a, must be expressible in terms of the a; 
as linear functions with integral coefficients; then any other lattice vector can 
also be expressed in terms of the a;. In other words, if we express the a, in 
terms of the a; in accordance with (129.2), the resulting formulae must be of 
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the type 
a; = Y Bua» 


with the 8, again integral. The determinant |8,,| is the reciprocal of the 
determinant |«,,|, and since both are integers it follows that the equation 


lan! =+1 (129.3) 


is a necessary and sufficient condition for the a; to be basic lattice vectors. 

Let us choose a lattice point and mark off from it three basic lattice vec- 
tors. The parallelepiped formed by the three vectors is called a unit cell of the 
lattice. The whole lattice can then be regarded as a regular assembly of such 
parallelepipeds. All the unit cells are, of course, identical in their properties; 
they have the same shape and size, and each contains the same number of 
atoms of each kind identically arranged. 

It is evident that identical atoms will be found at every vertex of every unit 
cell. All these vertices, therefore, are equivalent lattice points, and each can be 
brought to the position of any other by translation through a lattice vector. 
A set of all such equivalent points which can be brought into coincidence by a 
translation forms what is called a Bravais lattice of the crystal. This clearly 
does.not include every point of the crystal lattice; indeed, in general it does 
not even include all equivalent points, since the lattice may contain equiva- 
lent points which can be made to coincide only by transformations involving 
rotations or reflections. 

The Bravais lattice can be constructed by selecting any crystal lattice point 
and performing all possible translations. By taking initially some other point 
not in the first Bravais lattice we should obtain another Bravais lattice 
displaced relative to the first. It is therefore clear that the crystal lattice in 
general consists of several interpenetrating Bravais lattices, each correspond- 
ing to atoms of a particular type and position. All these lattices, regarded as 
sets of points (i.e. purely geometrically), are completely identical. 

Let us return now to the unit cells. Because the choice of the basic lattice 
vectors is arbitrary, that of the unit cell is also not unique. The unit cell can 
be constructed from any basic vectors. The resulting cells are, of course, of 
varying shapes, but their volumes are all equal. This is most simply seen as 
follows. It is clear from the above discussion that each unit cell contains one 
point belonging to each of the Bravais lattices that can be constructed in the 
crystal concerned. Consequently, the number of unit cells in a given volume 
is always equal to the number of atoms of a particular type and position, i.e. 
is independent of the choice of cell. The volume of each cell is therefore the 
same, and equal to the total volume divided by the number of cells. 
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§ 130. Crystal systems 


Let us now consider the possible types of symmetry of the Bravais lattices. 

First, we shall prove a general theorem concerning the symmetry of crystal 
lattices with respect to rotations. Let us see which axes of symmetry the 
lattice can have. Let A (Fig. 55) be a point of a Bravais lattice, lying on an 
axis of symmetry perpendicular to the plane of the diagram. If B is another 
point separated from A by one of the possible translations, a similar axis of 
symmetry must pass through B. 


Fia. 55 


Let us now perform a rotation through an angle @ = 2x/n about the axis 
through A, where n is the order of the axis. Then the point B and the axis 
through it will move to B’. Similarly, a rotation about B carries A into A’. 
From their construction, the points A’ and B’ belong to the same Bravais 
lattice, and so can be made to coincide by a translation. The distance A’B’ 
must therefore also be a translational period of the lattice. If a is the shortest 
period in the direction concerned, the distance A’B’ must therefore be equal 
to pa with p integral. It is seen from the figure that this gives 


a+2a sin (¢—$x) = a—2a cos d 
= pa, 
or cos ¢ = +(1—p). 

Since |cos @| « 1, p can be 3, 2, 1 or 0. These values correspond to 
= 2x/n with n = 2, 3, 4 or 6. Thus the crystal lattice can have axes of 
symmetry only of order 2, 3, 4 and 6. 

Let us now examine the possible types of symmetry of the Bravais lattice 
under rotations and reflections. These types of symmetry are called crystal 
systems, and each corresponds to a certain set of axes and planes of symmetry, 
i.e. iS a point group. 

It is easy to see that every point of a Bravais lattice is a centre of symmetry 
thereof. For to each atom in a Bravais lattice there corresponds another atom 
collinear with that atom and with the lattice point considered, and such that 
the two atoms are equidistant from this lattice point. If the centre of sym- 
metry is the only symmetry element of the Bravais lattice (apart from trans- 
lations), we have 
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1. The triclinic system. This system, the least symmetrical of all, corre- 
sponds to the point group C,. The points of a triclinic Bravais lattice lie at 
the vertices of equal parallelepipeds with edges of arbitrary lengths and arbi- 
trary angles between edges. Such a parallelepiped is shown in Fig. 56. 

The Bravais lattices are customarily denoted by special symbols; that of the 
triclinic system is denoted by I’,. 

2. The monoclinic system is next in degree of symmetry. Its symmetry ele- 
ments are a second-order axis and a plane of symmetry perpendicular to this 
axis, forming the point group C,,. This is the symmetry of a right parallel- 
epiped with a base of any shape. The Bravais lattice for this system can be 
constructed in two ways. In one, called the simple monoclinic Bravais lattice 
(I), the lattice points are at the vertices of right parallelepipeds with the ac 
face an arbitrary parallelogram (Fig. 56). In the other, the base-centred lat- 
tice (I), the lattice points are not only at the vertices but also at the centres 
of opposite rectangular faces of the parallelepipeds. 
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3. The orthorhombic system corresponds to the point group D,,. This is 
the symmetry of a rectangular parallelepiped with edges of any length. The 
system has four types of Bravais lattice. In the simple orthorhombic lattice 
(I’,), the lattice points are at the vertices of rectangular parallelepipeds. In 
the base-centred lattice (I”), there are in addition lattice points at the centres 
of two opposite faces of each parallelepiped. In the body-centred lattice (I), 
the points are at the vertices and centres of the parallelepipeds; finally, in the 
face-centred lattice (IY) ,the points are at the vertices and at the centre of each 
face. 

4. The tetragonal system represents the point group D,,; this is the sym- 
metry of a right square prism. The Bravais lattice for this system can be 
constructed in two ways, giving the simple and body-centred tetragonal 
Bravais lattices (I, and 3), whose points lie respectively at the vertices and at 
the vertices and centres of right square prisms. 

5. The rhombohedral or trigonal system corresponds to the point group 
D3,4: this is the symmetry of a rhombohedron (a solid formed from a cube 
by stretching or compressing it along a spatial diagonal). In the only Bravais 
lattice possible in this system (I°,,) the lattice points are at the vertices of 
rhombohedra. 

6. The hexagonal system corresponds to the point group D,,; this is the 
symmetry of a regular hexagonal prism. The Bravais lattice for this system 
(I’,) can be constructed in only one way; its lattice points are at the vertices 
of regular hexagonal prisms and at the centres of their hexagonal bases. It is 
useful to mention the following difference between the rhombohedral and 
hexagonal Bravais lattices. In both, the lattice points lie in planes perpendic- 
ular to the axis of order 3 or 6, and form a network of equilateral triangles; 
but in the hexagonal lattice the points are directly superimposed in successive 
such planes (in the direction of the C¢ axis); these planes are shown in plan in 
Fig. 57. In the rhombohedral lattice, on the other hand, the points in each 
plane lie above the centres of the triangles formed by the points in the pre- 
vious plane, as shown by the circles and crosses in Fig. 57. 

7. The cubic system corresponds to the point group O,; this is the symmetry 
of a cube. This system has three types of Bravais lattice: the simple cubic (I",), 
the body-centred cubic (I) and the face-centred cubic (IY). 

In the sequence of systems: triclinic, monoclinic, orthorhombic, tetragonal, 
cubic, each has higher symmetry than those which precede it, i.e. each con- 
tains all the symmetry elements which appear in the preceding ones. The 
rhombohedral system is similarly of higher symmetry than the monoclinic, 
while at the same time it is of lower symmetry than the cubic and hexagonal 
systems: its symmetry elements are present in both of the latter, which are the 
two systems of highest symmetry. 

We may also mention the following fact. It might appear at first sight 
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that further types of Bravais lattice beyond the fourteen listed above are 
possible. For instance, if we add to the simple tetragonal lattice a point at 
the centre of each opposite square base of the prisms, the lattice would again 
be of tetragonal symmetry. However, it is easy to see that this would not 
give a new Bravais lattice. For, on joining the points of such a lattice in the 
manner indicated in Fig. 58 by the broken lines, we see that the new lattice 
is again a simple tetragonal one. The same is easily found to be true in all 
similar cases. 


\ 
? \ / 
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The Bravais lattice parallelepipeds shown in Fig. 56 themselves have all 
the symmetry elements of the system to which they belong. However, it must 
be remembered that, for all the Bravais lattices except the simple ones, these 
parallelepipeds are not unit cells: the lattice vectors from which they are con- 
structed are not basic ones. As the basic lattice vectors in the face-centred 
Bravais lattices we can take the vectors from any vertex of the parallelepiped 
to the centres of the faces, in the body-centred lattices from a vertex to the 
centres of the parallelepipeds, and so on. Fig. 59 shows the unit cells for the 
cubic lattices I~ and I; these cells are rhombohedra and do not themselves 
possess all the symmetry elements of the cubic system. It is evident that the 
volume 0; of the face-centred Bravais parallelepiped is four times that of the 
unit cell: v, = 4v. The volumes of the body-centred and base-centred paral- 
lelepipeds are twice that of the unit cell: v, = 2v, v, = 22. 

In order to define completely the triclinic Bravais lattice, it is necessary to 
specify six quantities: the lengths of the edges of its parallelepipeds and the 
angles between the edges. In the monoclinic system four quantities are suffi- 
cient, since two of the angles between the edges are always right angles. Simi- 
larly, we easily find that the Bravais lattices of the various systems are defined 
by the following numbers of quantities (lengths of edges of parallelepipeds or 
angles between edges): triclinic 6, monoclinic 4, orthorhombic 3, tetrago- 
nal 2, rhombohedral 2, hexagonal 2, cubic 1. 
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§ 131. Crystal classes 


In many effects which may be called_macroscopic, a crystal behaves as a 
homogeneous and continuous body. The macroscopic properties of the crys- 
tal depend only on the direction considered in it. For example, the properties 
of the passage of light through a crystal depend only on the direction of the 
light ray; the thermal expansion of a crystal is in general different in different 
directions; finally, the elastic deformations of a crystal under various external 
forces also depend on direction. 

On the other hand, the symmetry of crystals brings about an equivalence of 
various directions in them. All macroscopic properties of a crystal will be 
exactly the same in such directions. We can therefore say that the macro- 
scopic properties of the crystal are determined by the symmetry of directions 
in it. For instance, if the crystal has a centre of symmetry, every direction in 
it will be equivalent to the opposite direction. 

Translational symmetry of the lattice does not lead to equivalence of direc- 
tions, since parallel displacements do not affect directions. For the same rea- 
son, the difference between screw axes and simple axes of symmetry, and 
between simple planes of symmetry and glide-reflection planes, does not 
affect the symmetry of directions. 

Thus the symmetry of directions, and therefore that of the macroscopic 
properties of the crystal, are determined by its axes and planes of symmetry, 
with screw axes and glide planes regarded as ordinary axes and planes. Such 
sets of symmetry elements are called crystal classes. 

As we already know, an actual crystal may be regarded as a set of several 
interpenetrating identical Bravais lattices. Because of this superposition of 
the Bravais lattices, the symmetry of an actual crystal is in general different 
from that of the corresponding Bravais lattice. 

In particular, the set of symmetry elements forming the class of a given 
crystal is in general different from its system. It is evident that the addition of 
further points to a Bravais lattice can only eliminate some of its axes or 
planes of symmetry, not introduce new ones. Thus the crystal class contains 
fewer (or at most the same number of) symmetry elements than the corre- 
sponding system, i.e. the set of axes and planes of symmetry of the Bravais 
lattice of the crystal in question. 

From this we can derive a method of finding all the classes belonging to a 
given system. To do so, we must find all the point groups which contain some 
or all of the symmetry elements of the system. It may happen, however, that a 
point group thus obtained comprises symmetry elements present in more 
than one system. For example, we have seen in§ 130 that all Bravais lattices 
have a centre of symmetry. The point group C, is therefore present in all 
systems. Nevertheless, the distribution of crystal classes among systems is 
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usually physically unique: each class must be assigned to the system of 
lowest symmetry among those which contain it. For example, the class C, 
must be assigned to the triclinic systém, which has no symmetry element 
except a centre of inversion. With this method of assigning the classes, a 
crystal having a certain Bravais lattice will never be placed in a class which 
could be constructed from a Bravais lattice of a system of lower symmetry— 
with one exception (see below). | 

The necessity of satisfying this condition is physically evident: it is phys- 
ically most improbable that the atoms in a crystal which belong to its Bravais 
lattice should be arranged more symmetrically than is required by the sym- 
metry of the crystal. Moreover, even if such a configuration were to occur 
by chance, any external perturbation, even a weak one (heating, for example), 
would be sufficient to destroy this configuration, since it is not imposed by 
the symmetry of the crystal. For instance, if a cubic Bravais lattice were to 
occur in a crystal belonging to a class for which the tetragonal system was 
sufficient, even a slight interaction would be capable of lengthening or short- 
ening one of the edges of the cubic cell, converting it into a right square 
prism. 

From this example we see the importance of the fact that the Bravais lattice 
of a system of higher symmetry can be converted to that of a system of lower 
symmetry by means of an arbitrarily small deformation. There is one excep- 
tional case, however, where such a transformation is not possible: a hexago- 
nal Bravais lattice cannot be converted by any infinitesimal deformation 
into the lattice of the rhombohedral system, which is of lower symmetry. For 
we see from Fig. 57 that, to transform the hexagonal into the rhombohedral 
lattice, it is necessary to move the lattice points in alternate layers by a finite 
amount from the vertices to the centres of the triangles. In consequence, all 
the classes of the rhombohedral system can be obtained with either a hexa- 
gonal or a rhombohedral Bravais lattice." 

Thus, to find all the crystal classes, we must first look for the point groups 
of the triclinic system, which has the lowest symmetry, and then go on in turn 
to systems of higher symmetry, omitting those of their point groups (i.e. 
classes) which have already been assigned to systems of lower symmetry. It is 
found that there are altogether 32 classes; a list of these arranged according 
to systems is as follows: 


System Classes 
Triclinic C, C, 
Monoclinic C... “Cy Cy 


t Crystals of rhombohedral classes with a hexagonal Bravais lattice are usually 
assigned to the rhombohedral system. 
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System Classes 
Orthorhombic Coy D Da, 
Tetragonal Sy» = Day Cy Cay 

Cu» D 49 D 4h 
Rhombohedral C3, So. C3, Ds, Dsa 
Hexagonal Cs» Day Cor Cons 
Ce De Der 
Cubic T; Tr, 1, O, O, 


In each of these sets of classes the last is the one of highest symmetry, and 
contains all the symmetry elements of the corresponding system. The classes 
whose symmetry is equal to that of the system are called holohedral classes. 
Those whose number of different symmetry transformations (rotations and 
reflections, including the identical transformation), is less than for a holo- 
hedral class by a factor of two or four are called hemihedral and tetartohedral 
classes respectively. For example, in the cubic system the class O, is holo- 
hedral, O, T,, and 7, are hemihedral, and TJ is tetartohedral. 


§ 132, Space groups 


Having studied the symmetry of the Bravais lattices and the symmetry of 
directions in the crystal, we can, finally, go on to consider the complete actual 
symmetry of crystal lattices. This symmetry may be termed microscopic, in 
contradistinction to the macroscopic symmetry of crystals discussed in§ 131. 
The microscopic symmetry determines those properties of a crystal which 
depend on the arrangement of the atoms in its lattice (e.g. the scattering of 
X-rays by the crystal). 

The set of (actual) symmetry elements of the crystal lattice is called its 
space group. The lattice always has a certain translational symmetry, and may 
also have simple, rotary-reflection and screw axes of symmetry and simple 
and glide-reflection planes of symmetry. The translational symmetry of the 
lattice is entirely determined by its Bravais lattice, since by the definition of 
the latter the crystal lattice can have no translational periods except those of 
its Bravais lattice. Hence, to determine the space group of a crystal, it is 
sufficient to find the Bravais lattice and to enumerate the symmetry elements 
which involve rotations and reflections, including of course the relative 
position of these axes and planes of symmetry. It must also be remembered 
that the translational symmetry of the crystal lattice means that, if the lattice 
possesses an axis or plane of symmetry, there exists an infinity of parallel 
axes or planes which are carried into one another by displacements through 
the lattice vectors. Finally, in addition to these axes (or planes) of symmetry 
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separated by lattice vectors, the simultaneous presence of translational sym- 
metry and the axes (or planes) of symmetry results in the existence of other 
axes (or planes) which cannot be made to coincide with the former by a trans- 
lation through any lattice vector. For example, the presence of a plane of 
symmetry involves not only planes parallel to it at distances equal to the lat- 
tice vector but also planes of symmetry which bisect each lattice vector: it 
is easily seen that reflection in any plane followed by translation through a 
distance d in a direction perpendicular to the plane is equivalent to reflection 
in a plane parallel to the first and at a distance $d from it. 

The possible space groups can be divided among the crystal classes, each 
space group being assigned to the class where the set of axes and planes of 
symmetry is the same as in the space group when no distinction is made in the 
latter between simple and screw axes and between simple and glide planes. 
Altogether 230 different space groups' are possible; they were first found by 
E. S. Fedorov (1895). The space groups are distributed among classes as 
shown in Table !. 


Table I 
Class Number of groups Class Number of groups 
Cc, 1 Se 2 
C; 1 Crp 6 
Cc. 4 D; 7 
C, 3 D3, 6 
C;, 6 Cu, 1 
Ci, 22 C, 6 
D, 9 Cu 2 
D,, 28 D3, 4 
S, 2 Cu 4 
C, 6 D, 6 
Cu, 6 Ds; 4 
Day 12 T 5 
Ci 12 T, 7 
D, 10 T, 6 
Du, 20 O 8 
C; 4 O, 10 


t Including eleven pairs that are the same except for the direction of rotation 
about their screw axes. 
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We shall not pause here to enumerate the symmetry elements of all the 
space groups, which would be a very lengthy process. They may be found in 
manuals of crystallography.* 

The space groups that contain no screw axes and glide planes are said to be 
symmorphic; there are 73 of these. The remaining 157 space groups contain 
such symmetry elements. The crystal lattices that belong to non-symmorphic 
space groups must certainly contain at least two identical atoms in each unit 
cell: since a rotation about a screw axis or a reflection in a glide plane 
involves a translation through a fraction of a basic lattice vector, the Bravais 
lattice points are not carried into one another by such transformations, 
and the crystal lattice must therefore consist of at least two interpenetrating 
Bravais lattices occupied by identical atoms. 


§ 133. The reciprocal lattice 


All physical quantities which describe the properties of a crystal lattice have 
the same periodicity as the lattice itself. Such quantities are, for example, 
the charge density due to the electrons in the atoms in the lattice, the proba- 
bility of finding an atom at a particular point in the lattice, and so on. 

Let the function U(r) be any such quantity.Its periodicity implies that 


U(r +7181 + node +n3a3) = U(r) (133.1) 


for any integral 7, ne, nz (a1, 22, ag being the basic vectors of the lattice). 
Let us expand the periodic function U(r) as a triple Fourier series, which 


may be written 
U= 2 U,e’’, (133.2) 


where the summation is over all possible values of the vector b. These are 
determined from the requirement that the function U, when put in the form 
of the series (133.2), satisfies the periodicity condition (133.1). This means that 
the exponential factors must be left unchanged when r is replaced by r-+a, a 
being any lattice vector. For this to be so it is necessary that the scalar pro- 
duct a+b should always be an integral multiple of 27. Taking a successively 
as the basic vectors a1, 2, ag, we must therefore have aisb = 27p1, ageb = 
2nxp2, aseb = 2xps, where pi, P2, Ps are positive or negative integers or zero. 
The solution of these three equations has the form 


b = pibi+pobo+psbs, (133.3) 


t A full account of the space groups is given, for example, by G.Yu. Lyubarskil, 
The Application of Group Theory in Physics, Pergamon, Oxford, 1960, and in the 
International Tables for X-Ray Crystallography, Vol. 1, Kynoch Press, Birming- 
ham, 1952. The latter also lists the equivalent points for each space group. 
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where the vectors b, are given in terms of the a, by 


by = 27a2Xa3/v, be = 27a3Xai/v, bs = 27a1Xao/v, 
v= @1°82XAasz. (133.4) 


We have thus determined the possible values of the vector b. The summation 
in (133.2) is taken over all integral values of p1, Do, Ds. 

Geometrically, the product v = aj*a2Xag represents the volume of the 
parallelepiped formed by the vectors aj, a2, ag, i.e. the volume of the unit cell; 
the products a1 Xa, etc., represent the areas of the three faces of this cell. 
The vectors b, therefore have the dimensions of reciprocal length, and in 
magnitude are equal to 27 times the reciprocal altitudes of the parallelepiped 
formed by the vectors a1, a, as. 

From (133.4) it is seen that b, and a, are related by 


arb, =0 if ixk, 
=2n if i=k. (133.5) 


Hence the vector b, is perpendicular to ag and ag, and similarly for bz and bs. 
Having defined the vectors b,, we can formally construct a lattice with 
bi, be, bs as basic vectors. This is called the reciprocal lattice, and the vectors 
bi, be, bs are called the (basic) vectors of the reciprocal lattice. 
Let us calculate the volume of the unit cell of the reciprocal lattice. This 
is v’ = byebgXbs. Substitution of the expressions (133.4) gives 


v= Cay is “3 @2 X A30(A3 X a1) X (a1 X a2) 


il 


(aoX @3°81) (as X @1°A>) 
= oti v. (133.6) 


It is evident that the reciprocal lattice cell corresponding to a triclinic 
Bravais lattice will also be an arbitrary parallelepiped. Similarly, the recipro- 
cal lattices of the simple Bravais lattices of the other systems are also simple 
lattices of the same system; for example, the reciprocal lattice of a simple 
cubic Bravais lattice also has a simple cubic cell. It is also easy to see by a 
straightforward construction that the reciprocal lattices of the face-centred 
Bravais lattices (orthorhombic, tetragonal and cubic) are body-centred 
lattices of the corresponding systems; the volume of the Bravais parallelepi- 
ped in the reciprocal lattice is v, = (2m) 8/v,, where v, is the volume of the 
Bravais parallelepiped in the original lattice. Conversely, the body-centred 


t The definition (133.4), which is now customary in the physics literature, differs 
by factors of 27 from that usual in pure crystallography. 
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original lattices correspond to face-centred reciprocal lattices, again with 
vy = (225)8/v,. Lastly, base-centred lattices have reciprocal lattices which are 
also base-centred, with v, = (22)? 4/u,. 

An equation of the form ber = constant, where b is a given vector, repre- 
sents a plane perpendicular to the vector b and at a distance from the origin 
equal to the constant divided by b. Let us take the origin at any of the 
Bravais lattice points, and let b = pib1+peb2+psbs be any vector of the 
reciprocal lattice (p1, pz, p3 being integers). Also writing r in the forma = 
181 +ea2+n3a3, we obtain the equation of a plane: 


bea/2x = nypitnePpetnsps = Mm, (133.7) 


where m is a given constant. If this equation represents a plane containing 
an infinity of Bravais lattice points (called a crystal plane), it must be satisfied 
by a set of integers 711, m2, m3. For this to be so, the constant m must clearly be 
an integer also. For given P1, P2, Ps, when the constant m takes various inte- 
gral values, equation (133.7) successively defines an infinity of crystal planes: 
which are all parallel. A particular family of parallel crystal planes thus de- 
fined corresponds to each reciprocal lattice vector. 

The numbers p1, P2, ps in (133.7) can always be taken as mutually prime, 
i.e. as having no common divisor except unity. If there were such a divisor, 
both sides of the equation could be divided by it, leaving an equation of the 
same form. The numbers p,, Pe, P3 are called the Miller indices of the family 
of crystal planes in question and are written as (p1p2Ps). 

The plane (133.7) intersects the coordinate axes (taken along the basic 
lattice vectors a1, 22, a3) at the points ma,/pi, ma2/p2, Mas/ps. The ratio of 
the intercepts (measured in units of a1, a2, a3 respectively) is 1/p1: 1/po: 1/ps, 
i.e. they are in inverse proportion to the Miller indices. For instance, the 
. Miller indices of planes parallel to the coordinate planes (i.e. having inter- 
cepts in the ratio oo : ©: 1) are (100), (010), (001) for the three coordinate 
planes respectively. Planes parallel to the diagonal plane of the basic paral- 
lelepiped of the lattice have indices (111), and so on. 

It is easy to find the distance between two successive planes of the same 
family. The distance of the plane (133.7) from the origin is 2xm/b, where b is 
the length of the reciprocal lattice vector concerned. The distance of the next 
plane from the origin is 27(m-+1)/b, and the distance d between these two 
planes is 


d= 2x/b. (133.8) 
The following formula is useful in applications: 


heb = vy dra), (133.9) 


28 
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where the summations on the left and right are taken over all vectors of the 
reciprocal and original lattices respectively. The sum on the right is a function 
of r that is periodic in the original lattice, and the left-hand side expresses it 
as a Fourier series.t The corresponding formula 


eles = of Yk —b) (133.10) 


follows immediately from (133.9), because of the symmetrical relationship 
between the two lattices. 


§ 134. Irreducible representations of space groups 


The physical applications of the theory of symmetry generally involve using 
the mathematical formalism of what are called representations of groups. In 
this section we shall discuss the question of the classification and method of 
constructing the irreducible representations of the space groups.? 

Let us first summarise in more mathematical terms the information given 
in the preceding sections about the structure of space groups. 

Each space group contains a sub-group of translations comprising an infin- 
ity of all possible parallel displacements which leave the crystal lattice 
unchanged; this sub-group is the mathematical expression of the Bravais 
lattice of the crystal. The complete space group is obtained from this sub- 
group by adding n symmetry elements involving rotations and reflections, 
where nis the number of symmetry transformations of the corresponding 
crystal class; we shall call these the rotational elements. Every element of the 
space group may be represented as the product of one of the translations 
and one of the rotational elements. 

If the space group does not contain screw axes and glide planes (a 
symmorphic group), the rotational elements can be taken simply as the n 
symmetry transformations (rotations and reflections) of the crystal class. In 
non-symmorphic groups, the rotational elements are rotations and reflections 
combined with a simultaneous translation through a certain fraction of one 
of the basic vectors of the lattice. In such space groups, the rotational ele- 


t The divergence of the sum when r = a occurs because the lattice is infinite. 
In considering a lattice of large but finite volume, the sum when r = ais to be 
equated to the number of lattice cells N. 

t The reader is assumed familiar with group theory to the extent given, for 
example, in Quantum Mechanics, Chapter XII. 

8 The translation sub-group is Abelian (all its elements commute) and is a nor- 
mal divisor of the complete space group: all elements of the group that are conju- 
gate to translations are also translations. (Two elements A and B are said to be 
conjugate if A = C71BC with C a third element of the group.) 
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ments of symmetry are “interlinked” with translations and do not themselves 
form a sub-group; for example, a repeated reflection in a glide plane is not an 
identical transformation but a translation through one of the basic vectors 
of the lattice. 

In order to characterise explicitly the elements of the space group, it is 
convenient to denote them by symbols (P| t), where P is any rotation or 
reflection and t the vector of a simultaneous translation; the effect of this 
element on the radius vector r of any point is shown by (P| t)r = Pr+t. The 
multiplication of elements follows the obvious rule 


(P’|t) (P|) = (PP! P’t+0t). (134.1) 
The element inverse to (P| t) is 
(P|t)-2 = (P-!| —P-4); (134.2) 


on multiplication by (P| t) it gives the unit element (E| 0) of the group, where 
E denotes the identical rotation. 

In particular, the pure translations are represented by the symbol (E| a), 
where a is any of the lattice vectors. The rotational elements in symmorphic 
groups, chosen as described above, are of the form (P| 0). In non-symmorphic 
groups, the rotational elements have the form (P| 7), where x is the fraction 
of a lattice vector through which translation takes place along the screw 
axis or glide plane. In the former case, the set of rotational transformations 
(P| 0) is itself a sub-group of the space group. In the latter case, the elements 
(P| 7) themselves do not form a sub-group, since a repeated application of 
them gives not the identical transformation but a translation through one of 
the basic lattice vectors. The rotations and reflections P as such (ie. if 
the differences between simple and screw axes, and between simple and glide 
planes, are ignored) always form a group, however, namely the point sym- 
metry group which defines the crystal class; this point group may conven- 
iently be called in this respect the direction group of the lattice." 

Let us now construct the irreducible representations of the space groups. 

Any such representation can be given by a set of functions of the form 


Pra = Uwe’ , (134.3) 


t In every case, the relation between the space group and the direction group may 
be formulated in the following way, using the group-theory standpoint. Let all 
the elements of the space group be distributed into n affine classes, each containing 
an infinity of products of one rotational element with all possible translations, 
i.e. all elements of the form (P| ¢+a) with given P and r. If we now regard the whole 
of each of these classes as an element in a new group, we get a factor group of the 
original space group, which is isomorphous with the direction group. 


t The arguments below are due to F. Seitz (1936). 


28* 
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where the k are constant wave vectors, the u,,, are functions invariant under 
translations, and the suffix « = 1, 2, ... labels functions with the same k. 

As a result of a parallel displacement r - r-+a (where a is any vector of 
the lattice), the functions (134.3) are multiplied by constants e***. In other 
words, the matrices of translations are diagonal in the representation given by 
the functions (134.3). It is evident that two vectors k which differ by b 
(where b is any vector of the reciprocal lattice) will give the same law of 
transformation of the functions ¢,, by translations: since acb is an integer 
times 27, e*» = 1. Such vectors k will be said to be equivalent. If we imagine 
the vectors k drawn from a vertex of a reciprocal lattice cell to various points, 
the non-equivalent vectors will correspond to the points in one unit cell. 

By the application of a rotational element of symmetry (P| 7), the function 
¢,,., is transformed into a linear combination of the functions ¢,,, with vari- 
ous values of « and a vector k’ which is obtained from k by means of the 
rotation or reflection in question, performed in the reciprocal lattice: 
k’ = Pk. The set of all (non-equivalent) vectors k which can be obtained 
from one another by the application of all n rotational elements of the group 
is called the star of the wave vector k. In the general case of arbitrary k the 
star contains n vectors (rays). The functions ¢,, which form the basis of an 
irreducible representation must always include functions having all the 
different rays of the star of k: since functions with non-equivalent k are 
multiplied by different factors under translations, no choice of linear combi- 
nations of them can bring about a decrease in the number of functions which 
are transformed into combinations of one another. 

For certain values of k the number of rays in its star may be less than n, 
since it may happen that some of the rotational elements of symmetry leave k 
unchanged or transform it into an equivalent vector. For example, if the vec- 
tor k is along an axis of symmetry, it is unchanged by rotations about this 
axis; a vector k from a vertex to the centre of the unit cell, i.e. k = <b, 
where b, is one of the basic vectors of the reciprocal lattice, is transformed 
by inversion into the equivalent vector —k = —+b, = k—b,. 

The set of rotational elements of symmetry (regarded as all being simple 
rotations or reflections P) which appear in a given space group and which do 
not alter the vector k (or which transform it into an equivalent vector) is 
called the proper symmetry group of the vector k, or simply the group of k; it 
is one of the ordinary point symmetry groups. 

Let us first consider the simple case of symmorphic space groups. The 
base functions of an irreducible representation of such a group can be written 


t In transforming the vector k in the reciprocal lattice all axes and planes of 
symmetry must, of course, be treated as simple ones, i.e. only the direction 
group is to be considered. 
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as products 
Pka = Us (134.4) 


where the u, are invariant under translations and the y, are linear combina- 
tions of the expressions e"**' (with equivalent k) invariant with respect to the 
transformations in the proper symmetry group of the vector k; in (134.4) 
this vector takes all the values in its star. In translations the functions u, are 
unchanged, but the functions y,, and therefore the ¢,,,. are multiplied by e***. 
In rotations and reflections belonging to the group of k, the functions y, are 
unchanged but the functions u, are transformed into combinations of one 
another. Thus the functions u, give one of the irreducible representations of 
the point group, these being called in this connection small representations. 
Finally, rotational elements which are not in the group of k transform 
sets of functions (134.4) with non-equivalent k into combinations of one 
another. The dimension of the representation of the space group thus con- 
structed is equal to the number of rays in the star of k multiplied by the 
dimension of the small representation. 

Thus the problem of finding all irreducible representations of symmorphic 
space groups reduces entirely to the classification of the vectors k with re- 
spect to their proper symmetry and the known problem of discovering the 
irreducible representations of finite point groups. 

Let us now consider space groups which have screw axes or glide planes. 
The presence of such elements of symmetry is still unimportant if the wave 
vector k is such that it remains unchanged (i.e. is not transformed into an 
equivalent vector) under all the transformations in its group." In this case the 
corresponding irreducible representations are again given by functions of the 
form (134.4), in which the u, form the basis of a representation of the point 
group of the vector k. The only difference from the previous case will be 
that under rotational transformations the functions y, = e*** in (134.4) will 
not remain unchanged, but will be multiplied by e***. 

Functions of the form (134.4) become inapplicable, however, if there are 
several equivalent vectors k transformed into one another by transformations 
in their proper symmetry group. In a rotational transformation with a 
simultaneous translation +, functions e*' with equivalent but different 
values of k are multiplied by different factors (since b-x/2x is not integral), 
and therefore their linear combinations y, will not be transformed into 
combinations of one another. 

In such cases it is no longer possible to consider the rotational elements and 
the translations separately, but of the infinity of translations it is sufficient 
to consider a finite number only. These cases arise for vectors k from a 


t This always includes, in particular, the vector k = 0 and a vector in a general 
position in which the unit element of its group is the identical transformation. 
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vertex of a unit cell of the reciprocal lattice to various special points within 
the cell, all or some of whose coordinates are simple rational fractions of the 
basic vectors b;, be, bs.‘ We shall call the extended group of the wave vector 
the group consisting of the rotational element (together with the relevant 
translations through fractions of a period t) and those translations for 
which k-a/2z is a rational fraction (<1); the remaining translations 
are again regarded as identical transformations. The functions @¢,,, which 
give irreducible representations of the finite group thus formed (small 
representations), together with the corresponding functions ¢,,, for other 
rays in the star of k, give an irreducible representation of the space group. 
The dimension of the small representations in these groups may be as high 
as six (for the crystal class O,).t 

This procedure will now be demonstrated for a specific example. 

Let us consider the space group (D3,) which corresponds to the simple 
orthorhombic Bravais lattice and contains the following rotational elements: * 


(E/0), (Cz!0), (C210), (3/9), 
(Iz), (ox! 4), (oy!4), (a2!) 


the x, y and z axes being taken along the three basic vectors of the lattice; 
t = 4+(a;+a2 +83); the axes of symmetry C2 are simple axes but the planes o 
perpendicular to them are glide planes. 


t In practice these fractions are usually 4, 4 or 2, the last two values occur- 
ring for groups of the rhombohedral and hexagonal systems. 


+ If the representations of the extended group of the wave vector are regarded 
as representations of the unextended group (one of the point groups), then the 
relations between the matrices G that represent the elements G of the group will be 
different from those between these elements themselves: if G,|G, = G3, the cor- 
responding matrices of the representation will in general be related not by a simi- 
lar equation G,G, = Gs (as in ordinary representations), but by G,G_ = @.G3, 
where @,, is a phase factor equal to unity only in modulus: |@,.| = 1. Such rep- 
resentations are said to be projective. All the essentially different projective re- 
presentations can belisted once and for all in each point group, and thereafter used 
as small representations for constructing irreducible representations of space 
groups. 

An account of the theory of projective representations, and tables of them for the 
prystallographic point groups, will be found in G. L. Bir andG. E. Pikus, Symmetry 
and Deformation Effects in Semiconductors (Simmetriya i deformatsionnye éffekty v 
voluprovodnikakh), Nauka, Moscow, 1972. 

There exist also complete tables of irreducible representations of space groups, 
which are given by O. V. Kovalev, Irreducible Representations of the Space Groups, 
Gordon & Breach, New York, 1965;C. J. Bradley and A. P. Cracknell, The Mathe- 
matical Theory of Symmetry in Solids, Clarendon Press, Oxford, 1972. 

§ The space groups are customarily denoted by the symbol! for the crystal class, 
with an index which arbitrarily numbers the group within the class. 
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Let us take, for example, the vector 
k = (, 0, 0); (134.5) 


the three numbers in parentheses give the components of the vector along the 
reciprocal lattice axes, measured in units of the edge lengths (b, = 27/a,) of 
the reciprocal lattice cell. Its proper symmetry includes all the axes and 
planes of the point group D,,, and so this vector isits own star. The extended 
group is got by adding the translation (E| a;) for which k-a/2x = 1. Thus we 
obtain a group of 16 elements in 10 classes as shown in the upper line in 
Table 2. The fact that the elements (C3|0) and (C3|a,), for example, are 
conjugate (i.e. belong to the same class) may be seen as follows. We have 


(1| r)~* (C30) Z| z) = G| —7) (C410) 2) 
= (| —%) (C3 | Cir) 
= (C3| —7 + Cr). 

But 
Cir = 3(—a,+a2— a5), 
—t+Cir = —a,—a, 
a,—(2a;+ as), 


and, since translations through a3 and 2a; must be regarded as identical 
transformations, we have 


(T| z)~* (CZ 0) Z| z) = (Ch a). 
Table 2 


(CZ10) (C310) Wit) = (o,|t) (ie) a!) 
(E| 0) (Z| a,) (Cz! 0) (Ci}a,) 

(Cyl a,) (CZ/ a) Z| +8,) (6,! e+8,) (oy|e +21) (0,! F +21) 
2. 29 2- “29° 6 0 0 0 0 0 
a 22 29 2 0 0 0 0 0 0 


From the numbers of elements and classes in the group we find that it has 
eight one-dimensional and two two-dimensional irreducible representations 
(8-1242-22 = 16). All the one-dimensional representations are obtained 
from representations of the point group D,,, the translation (E|a,) being 
assigned the character 1. These representations, however, occur here as 
“spurious” representations and must be rejected. They do not solve the 
problem in question: their base functions are invariant under all translations, 
whereas the functions e**" with given k are certainly not invariant with 
respect to the translation (E| a1). Thus there remain only two irreducible 
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representations, whose characters are shown in Table 2. The base functions 
of these representations can be taken as 


Iy:cosxx, sin2x, 
Ig: cos nx sin 2xy, sin zx sin 2ny; 


the coordinates x, y, z are measured in units of the respective basic vectors 
31, A2, 23. 
Let us also consider the representations corresponding to the star of two 


vectors 
k = (4,0,x), (4,0, —x) (134.6) 


with proper symmetry C,,, (the axis C, being along the z-axis); x is an arbi- 
trary number between 0 and 1 (other than 4). The extended group of k con- 
tains eight elements in five classes (Table 3). (The dependence of the base 
functions of the representations of this group on z reduces to a common 
factor e?™ or e7?™*, which is invariant under all transformations of the 
group; it is therefore unnecessary to extend the group by translations along 
the z-axis.) There are four one-dimensional and one two-dimensional 
irreducible representation of this group. The one-dimensional representations 
must be rejected for the same reason as previously, leaving only one 
representation, whose characters are shown in Table 3. Its base functions 


can be taken as 
ettnixz cosmx, etl sin rx, 


with the plus and minus signs in the exponent for the two vectors (134.6) 
respectively; the complete irreducible representation of the whole space group 
is four-dimensional, and is given by all four of these functions together. 


Table 3 
(C310) (ox1 4) (a, | 7) 
(E|0) (E/a;) 
(C3)a)) (0,|t+a;) (o,| 7+) 


2 —2 0 0 0 


§ 135. Symmetry under time reversal 


In physical applications of the theory of symmetry groups, a further condi- 
tion is usually imposed on their representations, namely that the base func- 
tions should be real (or rather reducible to a real form). This arises in con- 
sequence of the symmetry under time reversal. In quantum mechanics, this 
symmetry has the result that complex-conjugate wave functions must cor- 
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respond to the same energy level of the quantum system, and must therefore 
appear among the base functions of the same physically irreducible represen- 
tation (cf. Quantum Mechanics, § 96). In classical theory, however, this. 
symmetry is expressed by the invariance of the equations of motion under 
the change t + —?¢; the equation contains time derivatives of an even order, 
the second. Consequently, the equations for the displacements u, of the 
atoms in the lattice remain real when their solution is sought in the complex 
form (69.6) (oc e~‘"); the amplitudes of these expressions can therefore be 
taken to be real.* 

Real base functions, of course, remain real under the action of any element 
of symmetry; thus all the group representation matrices are real. If, however, 
any irreducible representation does not satisfy the condition, it must be 
combined with the complex-conjugate representation to form one physically 
irreducible representation of twice the dimension. Let us consider from this 
standpoint the cases that can occur for representations of space groups 
(C. Herring, 1937). 

The simplest case in this respect is the one where the stars of the wave 
vectors k and —k do not coincide. Then the irreducible representations con- 
structed from each star are certainly complex. For example, for the star of k 
the base functions of the representations are multiplied by factors e*** under 
translations (£|a), and these factors include none that are complex con- 
jugates; it is therefore clear that no choice of linear combinations of these 
functions can bring the matrices of the transformations to a real form. 
On the other hand, by taking the complex conjugates of these functions we 
obtain the complex-conjugate representation belonging to the star of the 
vector —k. By combining these two representations, we arrive at a real 
representation. Thus, to find a physically irreducible representation, we must 
include —k together with k in the star of each wave vector. That is, to obtain 
the whole of the required star we must apply to some original k all the ele- 
ments of the direction group plus the centre of symmetry. 

If, however, the star of the wave vector already contains all the necessary 
values of k, this does not guarantee that the irreducible representations 
constructed from them are real, as can be shown by a simple example. 

Let us consider the symmorphic space group Sj, belonging to the crystal 
class S, and having a simple tetragonal Bravais lattice. In this group, we 
take the representations corresponding to the star of two vectors 


k = (0,0,%), (0,0, —), (135.1) 


where the z-axis is along the symmetry axis S4, and is an arbitrary number 
between 0 and 1! ( other than 4). The proper symmetry of these vectors is C2, 


t This will not be so in the presence of a magnetic field or in a crystal having a 
magnetic structure. 
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a point group having two one-dimensional representations with the following 
characters: 


E Cy, 
A 1 1 
B 1-t 


Taking the first of these as the small representation, we get a two-dimensional 
representation of the whole space group, whose basis can be taken as the 
complex conjugate functions exp (+ 22ixz); this representation is therefore 
real. When B is the small representation, we have a two-dimensional repre- 
sentation of the whole group, with base functions 


exp (27ixz) cos 2nx, exp (—2ixz) sin 2nx. 


The characters of the rotational elements of the group in this representation 
are 
(E|0) (S,{0) (C,{0) (S3] 0) 
2 0 —2 0 


and those of the translations are 


(Ela1) (Ejaz) (E/as) 
2 2 2 cos 27x 


All these characters are real, but the representation is nevertheless complex: 
its base functions cannot be brought into a real form. The physically irre- 
ducible representation is found by adding to these functions their complex 
conjugates, and in this case therefore by combining two complex-conjugate 
but equivalent (with the same characters) representations. 

In the example considered, the symmetry under time reversal doubles the 
dimension of the physically irreducible representation for values of the wave 
vector on a straight line (the axis of symmetry) in k-space. There are also 
cases where this doubling occurs for values of k occupying a whole plane in 
k-space, this plane being perpendicular to a second-order screw axis. 

Let us consider, for example, the non-symmorphic space group C3, which 
belongs to the crystal class C, and has a simple monoclinic Bravais lattice. 
The second-order axis, which we take as the z-axis, is here a screw axis, 
with a translation through half a lattice vector: (C2|4as). In this group, 
we take the star of two wave vectors: 


k= (x,1,4), (—*, —A, +), (135.2) 


* This kind of situation did not arise in point groups, for which all the irredu- 
cible representations with real characters are real. 
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where x and / are arbitrary numbers between 0 and 4; the x- and y-axes are 
oblique, in a plane perpendicular to the axis of symmetry. The star includes 
k and —k, since the vectors (—x, —4, —}) and (—x, —A, 4) are equivalent. 
This star corresponds to two equivalent (with the same real characters) 
two-dimensional irreducible representations of the group, given respectively 
by the base functions 


and their complex conjugates. A physically irreducible representation is 
obtained by combining these two complex-conjugate representations. Its 
four base functions fall into two pairs each corresponding to one of the two 
wave vectors in the star: 


gaalextieting and  e-2nllex+detinz , 


If the irreducible representation is found together with its base functions, 
there is no difficulty in seeing whether it is real or complex. Nevertheless, 
in more complicated cases (and for the investigation of certain general 
problems) it is useful to have a criterion for answering this question from the 
characters of the small representation. Such a criterion can be obtained from 
the following general theorem in the theory of group representations.’ 

For each of the irreducible representations of the group, the following 
sum can have one of three values: 


=) (G2) = +1(a), 0(b), —1(0). (135.3) 


The summation is over all the elements in the group, and g is the order of 
the group. Correspondingly the representation is (a) real, (b) complex with 
non-equivalent complex-conjugate representations (they have complex- 
conjugate characters), (c) complex with equivalent complex-conjugate repre- 
sentations (they have equal real characters). 

We shall outline the way in which this criterion can be converted for 
application to space groups, but not give the details. According to the method 
described in § 134 for the construction of irreducible representations of 
space groups, their characters may be expressed as 


xi(P| ¢+a)] = 2, tu (P| 7)} exp (ikiea), (135.4) 


where y,[(P|k)] are the characters of the rotational elements of the group in 
the small representation, and the summation is over those rays k1, ke, ... of 


t It is proved, for instance, in the books mentioned in the footnotes to §§ 132 
and 134. 
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the star of the wave vector for which P is an element of the symmetry group. 
Applying this formula to the element 


(P\|t+a)? = (P?\7+Pr+a+Pa) 
= (P| 7) (FE! a+ Pa), 


we have 
y(P | t+a)"] = 2 x,{(P| 7)*Jea- et Pho, 


with the substitution k,-Pa = a-P~1k, in the exponent. These characters 
are to be summed over all translations and all rotational elements (P| 7). 
The sum 


y) exp {ia-(k;+ P~1k;)}} 


is zero except when k, +P~1k, = 0 or b. Lastly, because all rays in the star 
are of equal significance, in the sum over 7 which is finally to be calculated 
all the terms are the same. 

We thus obtain the Herring criterion: 


5 EnllP |e) = +1 (a), 0(b), —1(0), (135.5) 


where y,, are the characters of the small representation and the summation is 
over those rotational elements (P| 7) of the space group which convert k 
into a vector equivalent to —k: Pk = —k+b;' n, is the number of rotational 
elements in the proper symmetry of the wave vector. 

In particular, if the space group contains no rotational elements having 
the specified property, the sum (135.5) has no terms, and case (b) occurs, 
in accordance with our previous discussion of the case where the stars of k 
and —k are not the same. 

In the example of the group Sj discussed above, the elements (S,|0) 
and (S3| 0) have the desired property; their squares are the element (C,,|0). 
The sum (135.5) is therefore 


Fre (Ss | OF) + xul(S4! 0)°T} = wul(C2! 0]; 


it is 1 for the small representation A and —1 for the small representation B, 
for which we thus have cases (a) and (c), again in accordance with previous 
results. 


t Here (P| x)” does not alter the vector k (or converts it into an equivalent vector), 
and is therefore certainly part of the proper symmetry group of the vector k. 


§ 136 Symmetry Properties of Normal Vibrations of a Crystal Lattice 427 


§ 136. Symmetry properties of normal vibrations of a crystal lattice 


One physical application of the mathematical theory of the representations 
of space groups is the classification of the normal vibrations of a lattice by 
means of their symmetry properties.’ 

It may be recalled that a lattice with v atoms in the unit cell has for any 
given wave vector k a total of 3y normal vibrations, each with its frequency 
«(k). That is, over the whole range of variation of k the dispersion relation 
@ = w(k) of the vibrations has 3y branches w,(k); each of the w,(k) takes 
values in a certain finite range, the phonon energy band. All the essentially 
different values of the wave vector lie in one unit cell of the reciprocal lattice; 
if we consider the whole infinite reciprocal lattice, the functions w,(k) are 
periodic in it: 

@a(k +b) = w.(k). (136.1) 


The physical principles of the classification of lattice vibrations by means 
of the irreducible representations of the symmetry group are the same as for 
the corresponding classification of finite symmetrical systems (polyatomic 
molecules; see Quantum Mechanics, § 100). The normal coordinates of the 
vibrations, which act as the basis of an irreducible representation of the 
symmetry group of the lattice, all belong to one frequency. 

Each irreducible representation of the space group is specified, first of all, 
by its star of wave vectors. From this it follows immediately that the fre- 
quency is the same for all normal vibrations that differ only by values of k 
from the same star. Thus each function w,(k) has the complete symmetry of 
directions for the crystal class concerned. As mentioned in § 135, the sym- 
metry under time reversal means that the star of k must be supplemented 
by all vectors —k (if k and —k themselves do not coincide); thus we always 
havet 

a(—k) = o,(k). (136.2) 


For a given value of k (i.e. for one ray of the star), the normal coordinates 
are distributed among the bases of the small representations corresponding 
to various frequencies. If the dimension f of the small representation is more 
than one, there is degeneracy for the given value of k, the frequency in f 
branches being the same. 


+ Representations of space groups were first applied to the study of the physical 
properties of crystal lattices by F. Hund (1936) and by L. P. Bouckaert, R. Smolu- 
chowski and E.P. Wigner (1936). 


t From the physical point of view, the relation between the transformation 
k + —k for the lattice vibrations and time reversal is obvious: a change in the 
sign of the time reverses the direction of wave propagation (or, in terms of the 
phonon picture, changes the sign of the phonon momentum p = Hk). 
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When the vector k has a general position in the reciprocal Jattice, it has 
no proper symmetry (its group contains only the unit element, Le. the iden- 
tical transformation); all 3v values of w,(k) are in general different. Degener- 
acy may occur if the proper symmetry of the wave vector is so high that its 
group has irreducible representations with dimension f > 1. With reference 
to the spatial symmetry only, this may take place either at isolated points 
in the lattice or on entire straight lines (axes of symmetry) in it. The symmetry 
under time reversal may also cause (twofold) degeneracy on entire planes 
in k-space (F. Hund, 1936; C. Herring, 1937). According to the discussion in 
§ 135, this degeneracy can occur on planes perpendicular to a second-order 
screw axis; see the example of representations associated with the star 
(135.2).7 

In order to classify the normal vibrations of a specific crystal lattice, we 
must first find the complete vibrational representation of the space group 
given by all the vibrational coordinates (atomic displacement vectors) 
together. This representation is reducible; by decomposing it into irreducible 
parts, we find the degree of degeneracy of the frequencies and the symmetry 
properties of the corresponding vibrations. It may happen that one represen- 
tation appears several times in the vibrational representation: this means 
that there are several different frequencies with the same multiplicity corre- 
sponding to vibrations of the same symmetry. 

This procedure is analogous to the method of classifying molecular vi- 
brations (Quantum Mechanics, § 100). There is an important difference, how- 
ever, in that the lattice vibrations are further characterised by the parameter 
k, which takes a continuous series of values, and the classification must be 
carried out for each value (or class of values) of the wave vector separately. 
If k is specified, the star of the irreducible representation of the space group 
is determined. It is therefore sufficient in practice to find the vibrational 
small representation and decompose it into irreducible small! representations, 
ie. irreducible representations of the symmetry group of the vector k. 

The classification of the lattice vibrations in the limit as k ~ 0 is especially 
simple. When k = 0, the irreducible small representations for all the space 
groups (whether or not symmorphic) are the same as the irreducible rep- 
resentations of the point symmetry group of the lattice, namely its crystal 
class. To find the vibrational representation (D,;,) we must consider only 


t Besides degeneracies due to the lattice symmetry, there can also be degeneracy 
for “accidental” values of k; the existence of such degeneracies could be theoreti- 
cally predicted only by actually solving the equations of motion of the atoms in a 
specific lattice. For an analysis of possible cases, see C. Herring, Physical Review 
§2, 365, 1937 (reprinted in R.S. Knox and A. Gold, Symmetry in the Solid State, 
Benjamin, New York, 1964), 
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the atoms in one unit cell, i.e. regard all translationally equivalent atoms? as 
one. Without repeating the whole discussion of the atomic vibrations in the 
molecule for this case, we can formulate the following rule for finding the 
characters of the vibrational representation of the lattice when k = 0. The 
characters of a rotation C(¢) through an angle ¢ about an axis of symmetry, 
or S(#) about a rotary-reflection axis, are 


Avi0(C) = re%(C), Yviv(S) = vsxo(S), (136.3) 


where 
A(C) = 1+2 cos @, LAS) =—14+2cos ¢ 


are the characters of the representation given by the three components of a 
(polar) vector; »¢ and v, are the numbers of atoms which remain fixed or 
move to translationally equivalent places in the transformation.! The same 
formulae give the characters for reflection in a plane (transformation S(0)) 
and for inversion in a centre of symmetry (transformation S(z)). A rotation 
about a screw axis or a reflection in a glide plane certainly brings all the 
atoms into translationally non-equivalent positions, and for them we there- 
fore always have y,;, = 0. 

These rules may be illustrated by an example.’ The diamond lattice 
belongs to the non-symmorphic space group O%. It has a face-centred cubic 
Bravais lattice with two like atoms in the unit cell at the vertices (000) and 
at the points (¢ + +) on the spatial diagonals of the cubic cells.|! Half the rota- 
tional elements of the group O} coincide with rotations and reflections in the 
point group 7,,. These transformations leave both atoms fixed or move them 
to translationally equivalent positions. The characters of the vibrational 
representation for these elements are therefore y,,, = 2y,. The remaining 


t These are the occupied sites in any one Bravais lattice. 

t For a molecule, a subtraction is needed in the characters of the vibrational 
representation in order to eliminate the coordinates that correspond to the displace- 
ment or rotation of the molecule as a whole. For a lattice, the number (six) of 
these degrees of freedom is negligible compared with the total number of degrees 
of freedom, and such a subtraction is not necessary. 

§ To avoid misunderstanding, we may note that the classification of the limit- 
ing frequencies of the optical branches of the vibrations by means of the crystal- 
lographic symmetry only of the lattice is not valid for ionic crystals. The long- 
wavelength optical vibrations of the ionic lattice are accompained by the occurrence 
of macroscopic polarisation of the crystal and an associated macroscopic electric 
field, which in general modifies (and reduces) the symmetry of the vibrations. 

U The coordinates of the atoms are specified with respect to the edges of the cubic 
cell and in units of the length of these edges. The volume of a face-centred cubic 
cell is four times that of the unit cell. The basic lattice vectors go from the vertex 
to the points ( 0), ($ 0 +), (0 $ 4), which are the centres of the faces of the 
cubic cell, 
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rotational elements of the group Oj} are screw rotations and reflections in 
glide planes, obtained by combining the elements of the group 7, with 
inversion (J| v7), where r]= (44 4); these elements move the atom at (000) to 
the translationally non-equivalent point (4 4), so that their characters are 
Yip = 0. The decomposition of the resulting vibrational representation into 
irreducible representations of the point group O, is Dy, = Fo,+F2,." The 
coordinates of the acoustic vibrations which describe the movement of the 
cell as a whole when k = 0 are transformed as the components of a vector, 
and therefore correspond to the representation F,, by which such compo- 
nents are transformed in the group O,. The representation F,, corresponds 
to the triply degenerate limiting frequency of the optical vibrations.t 

The degeneracy of the optical vibrations is generally removed on leaving 
the point k = 0. According to the symmetry, the amount of splitting may 
vary near k = 0 as a homogeneous function of the first or second order in 
the components of the vector k. The appropriate criterion is easily ascer- 
tained in terms of quantum perturbation theory. The Hamiltonian of lattice 
vibrations with a small wave vector k = 6k has the form A, +)-6k, where A, 
is the Hamiltonian of vibrations with k = 0, and } is a vector operator; 
the term -6k acts as the perturbation which causes the splitting. The amount 
of splitting is of the first order in 6k if ? has non-zero matrix elements for 
transitions between states belonging to the same degenerate vibrational 
frequency; otherwise, the splitting is of the second order in ék. Here it must 
be noted that the operator } changes sign under time reversal, since the wave 
vector dk changes sign while the product ?-dk, like any Hamiltonian, must 
be invariant under time reversal. Thus the solution of the problem reduces 
to finding the selection rules for frequency-diagonal matrix elements of a 
vector operator that changes sign under time reversal (see Quantum Mecha- 
nics, § 97). If the degenerate frequency corresponds to an irreducible rep- 
resentation D, these rules depend on the decomposition of the antisymmetric 
part of its direct product with itself, {D}; non-zero matrix elements exist if 
this expansion contains parts by which the components of a vector are 
transformed. 


t The point group O, may be regarded as the direct product OX C; or T7xC;; 
here we use the second form. Accordingly, the irreducible representations of the 
group O, are constructed from representations of the group 7,. In particular, the 
representations F,, and Fy, of O, are obtained from the representation F, of Ty, 
and differ in being respectively even and odd under inversion (see Quantum Mecha- 
nics,§ 95). 

t The limiting frequency of the acoustic vibrations is always degenerate: the 
macroscopic nature of these vibrations leads to a single value w = 0 for all three 
branches, even if this is not required by symmetry. In this sense the degeneracy is 
“accidental”, 
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The splitting will certainly be of the second order in 6k if the point sym- 
metry group of the lattice (the crystal class) contains a centre of inversion; 
this is evident, since the squared basis of the representation {D?} must be 
even under inversion, whereas the components of a vector change sign. If the 
crystal class does not contain an inversion, there are two possibilities. For 
example, in the crystal class O, the antisymmetric products for the two- 
dimensional irreducible representation E and the three-dimensional rep- 
resentations F, and F, aret 


{E}=A, {FR = {Fp = Atk. 


The components of a vector are transformed by F1; hence the splitting of the 
doubly degenerate frequency will be of the second order in 6k, and that of 
the triply degenerate ones will be of the first order. 

Let us now consider vibrations with non-zero wave vector. For symmorphic 
space groups, they are classified in the same way as in the case k = 0 
described above. The irreducible small representations are here the same as 
the irreducible representations of the point symmetry group of the vector k, 
and to find the vibrational small representation we must again consider only 
the atoms in one unit cell. 

The procedure may be demomstrated for the optical vibrations of the dia- 
mond lattice. The face-centred Bravais lattice of this structure corresponds 
to a body-centred cubic reciprocal lattice. At the point k = 0 (the vertex 
of the cubic cell) the proper symmetry of the wave vector is O,, and there is 
(as shown above) one triply degenerate frequency of optical vibrations, 
corresponding to the representation F,,; the characters of this represen- 
tation aret 

E 8Cs 3C2 60’ 6S, I 8S. 30 6Co 6C, 
Fe 3 O -1 1 -1 3 0 -1 1 -1 


Let us now see how this frequency is split when we leave the point k = 0. 

A displacement along the spatial diagonal of the cubic cell gives the vector 
k the proper symmetry C,,. For this group the representation given by the 
same three vibrational coordinates is reducible: 


E 2C3 30° 
3 0 1= E+4A3, 


t The notation for the irreducible representations of point groups is as in Quan- 
tum Mechanics,§ 95. 


t The symmetry elements that appear in the point group 7, are put first, followed 
by the elements obtained on multiplication of these by the inversion J. The ele- 
ments 3C, are rotations through 7 about axes along the edges of the cubic cell; 
6C,, are rotations throughz about the diagonals of the faces; 60” are reflections in 
planes through opposite edges; 30 are reflections in the planes of the faces. 


29 
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i.e. the triply degenerate frequency splits into one doubly degenerate and 
one non-degenerate frequency. A similar splitting occurs in a displacement 
along an edge of the cubic cell, where the proper symmetry of the wave 
vector is C,,: 

E Cy 3C, 20 20° 

3 -1 -—-1 -1 1= E+Be. 
In a displacement along a diagonal of a face of the cubic cell, the proper 
symmetry of the vector k is lowered to C,,, and the frequency splitting is 
complete: 

EC, o¢ @& 

3 1 —-1 l= Ay+ Ao+Be. 


For crystal lattices of non-symmorphic space groups, the procedure for 
classifying the normal vibrations is more laborious, and we shal! not pause 
to discuss it here." 


§ 137. Structures periodic in one and two dimensions 


A characteristic feature of solid crystals is that the density function (x, y, z) 
is periodic in three dimensions and extends to unlimited distances. Let us 
consider the possibility of the existence in Nature of bodies whose density 
functions are periodic in only one or two dimensions (R. E. Peierls, 1934; 
L. D. Landau, 1937). 

For example, a body with o@ = 0(x) could be regarded as consisting of par- 
allel planes regularly arranged and lying perpendicular to the x-axis, with 
the atoms randomly distributed in each plane. When e = 0(x, y), the atoms 
would be randomly distributed along lines parallel to the z-axis, but these 
lines themselves would be regularly arranged. 

To discuss this question, let us consider the displacements undergone by 
small parts of the body as a result of thermal fluctuations. It is clear that, if 
such displacements increase without limit as the size of the body increases, 
there will necessarily be a “smoothing-out” of the function @, in contradic- 
tion with hypothesis. In other words, only those structures can occur for 
which the mean displacement remains finite when the dimensions of the body 
become arbitrarily large. 

Let us first confirm that this condition is satisfied in an ordinary crystal. 
Let u(x, y, z) denote the vector of the fluctuation displacement of a small 
region with coordinates x, y, z and let u be represented as a Fourier series: 


w= 2 mKen* 5 (137.1) 


t Examples of such groups are given in the book by Bir and’ Pikus quoted in 
§ 134. 
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the components of the vector k take both positive and negative values, 
and the coefficients u,, are related by u_,, = uy, Since wis real. The series (137.1) 
will include only terms with not too large wave numbers, k < 1/d, where dis 
the linear dimension of the region undergoing displacement. We shall 
consider the fluctuations at constant temperature; then their probability is 
given by 

w oc exp (—AF,/T), (137.2) 


where 
AF, = | (F—-F) av (137.3) 


is the change in the total free energy of the body in the fluctuation, and F 
now denotes the free energy per unit volume of the body (cf. (116.7)). 

To calculate AF,, we must expand F—F in powers of the displacement. 
The expansion will involve not the function u(x, y, z) itself but only its deriv- 
atives, since F— F must vanish when u = constant, corresponding to a simple 
displacement of the body as a whole. It is evident that the terms in the ex- 
pansion which are linear in these derivatives must be absent, since otherwise 
F could not have a minimum for u = 0. Next, owing to the smallness of the 
wave numbers k, we need go only as far as the terms quadratic in the first 
derivatives of u in the expansion of the free energy, neglecting the terms 
containing the higher-order derivatives. Hence we find that AF, has the 
form 


AF, = VY UnuUi® (Kx, Ks k.), (137.4) 


where the elements of the real tensor ¢, (i and / being the tensor indices, 
over which summation is implied) are quadratic functions of the compo- 
nents of the vector k.* 

According to (111.9), we hence find for the mean square fluctuations of 
the Fourier components of the displacement vector 


(unattic) = (IV 67h (Kas keys Ka), Cindi) = 0 for k’ #—k, (137.5) 


where @~;, are the components of the tensor inverse to ,.¢ For clarity, 
this expression may be written 


(unde) = +7 a (137.6) 


t The terms containing products UnYny exp [i(k-+k’)er] with k’ + —k disappear 
on integration over the volume. 

t To determine the constant numerical coefficient in (137.5), we have to note 
that each product u,,u7, appears twice (for +k) in (137.4), giving 2 re (u,,un), 
and the real part of the product Unln. is itself the sum of two independent pro- 
ducts. 


29¢ 
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where the A,, depend only on the direction of the vector k (n = k/k). The 
mean values (u,u,) are found from (137.6) by summation over k; changing 
in the usual manner from this to integration, we get, for example, the mean 


square displacement vector 
A;(n) d®k 

w 
ay fa (I kk? (2n)> 


=T { Aun) S22 aS ; (137.7) 


The integral converges at the lower limit (k — 0) linearly in k.‘ Thus the 
mean square of the fluctuation displacement is, as it should be, a finite 
quantity independent of the size of the body. 

Next, let us consider a body with density function @ = e(x). Since 9 = 
constant along the y- and z-axes in such a body, no displacement along 
these axes can “smooth out” the density function, and such displacements 
are consequently of no interest here. We need therefore consider only a 
displacement u,. Moreover, it is easy to see that the first derivatives du,/dy, 
du,,/0z cannot appear in the expansion of the free energy, since, if the body is 
rigidly rotated about the y- or z-axis, these derivatives change, whereas the 
freé energy must obviously remain constant. Thus in the expansion of F—F 
we have to consider the following terms quadratic in the displacement: 


ux 2 Ou, O2u, ie Ou, Ou, fa O7u, \’ 

(3) > Ox (s =a) (a oz? }” 
the derivatives with respect to y and z appear symmetrically, owing to the 
complete symmetry in the yz-plane. Substitution in (137.3) leads to terms of 


the types 
| ty | Kes | Ux |? ko", [ sex |2 x, 


where x” = k}+k2. Although the two latter expressions include powers of 
the wave vector components higher than the first expression, they may be 
of the same order of magnitude, since nothing is known a priori concerning the 
relative magnitude of k, and x. 

Thus the change in the free energy will be of the form 


AF, = VF leel? O(n #9, (137.8) 


where # is a quadratic function of the variables k, and x*. Instead of (137.7), 


we now have 
dk dk, dk, d(x?) 


al Tl aE kn) Qa ~ Bat | Gene 7) 


* The integrand as written is valid only when & is not too large. 
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This integral is logarithmically divergent when k + 0. The divergence of the 
mean square displacement implies that a point to which a particular value of 
e(x) corresponds may be displaced through very large distances; in other 
words, the density o(x)is “smoothed out” through the whole body, so that 
no function e(x) is possible except the trivial case @ = constant. 

Similar arguments for a body with e = o(x, y) give the following expres- 
sion for the mean squares of the displacement: 


x dk, dk, 
(ua), (u2) = oF FS (137.10) 
where ¢ is again a quadratic function of its arguments. This integral is easily 
seen to converge at the lower limit, so that the mean fluctuational displace- 
ment remains finite. Thus bodies having such a structure could in theory 
exist, but it is not known whether they do in fact exist in Nature. 

So far in this section we have been concerned with three-dimensional bod- 
ies, and only the arrangement of the atoms was assumed to have a two- 
dimensional (or one-dimensional) ordering. Let us now consider the possi- 
bility of an ordered configuration of atoms in two-dimensional! systems where 
the atoms occupy only a certain surface.’ The two-dimensional analogue of 
ordinary solid crystals would be a film in which the atoms are regularly ar- 
ranged at the points of a plane lattice. This configuration could be described 
by adonsity function o(x, y) (which now has a different meaning from 
that used above, since only atoms on one surface z = constant are consid- 
ered). It is easy to see, however, that the thermal fluctuations “smooth 
out” such a crystal, so that 9 = Constant is the only possibility: the mean 
values of the products of components of the fluctuational displacement 
u (in the xy-plane) are again determined by expressions of the form (137.6) 
and (137.7), except that the integration is now taken over two-dimensional 
k-space: 


(uj) = r| Auto) oe : (137.11) 


and the integral diverges logarithmically as k — 0. 

Here, however, the following reservation is necessary. The result obtained, 
strictly speaking, means only that the fluctuational displacement becomes 
infinite when the size (area) of the two-dimensional! system increases without 
limit (so that the wave number may be arbitrarily small). But, because of the 
slow (logarithmic) divergence of the integral, the size of the film for which the 


+ Such are monomolecular adsorbed films at the interface between two iso- 
tropic phases; see § 159. 
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fluctuations are still small may be very great.t In such cases, a film of finite 
size might have practically “solid-crystal” properties and be approximately 
describable as a two-dimensional lattice. We shall see in § 138 that such 
properties of two-dimensional systems become more marked at lower tem- 
peratures. 


§ 138. The correlation function in two-dimensional systems 


The expression (137.11) gives the mean square fluctuational displacement at 
any specified point in a two-dimensional crystal system. A fuller under- 
standing of the properties of such systems can be achieved by considering 
the correlation function between the fluctuations at different points in the 
system. 

Let us note first of all that, when T = 0, a two-dimensional lattice of any 
size could exist: the divergence of the integral (137.11) is due to the thermal 
fluctuations (T + 0). Let go(r) be the density function of the system at T = 0.+ 
We can determine the correlation function of the density fluctuations for 
non-zero but sufficiently low temperatures (much less than the Debye tem- 
perature). Under these conditions, only long-wavelength vibrations are ex- 
cited in the lattice, ie. the variation of the density function is determined 
mainly by the long-wavelength fluctuations. 

Let the atoms at the points r in the lattice undergo fluctuational displace- 
ments u(r). If the function u(r) varies only slightly over distances of the order 
of a lattice constant (corresponding to the fluctuations with small wave 
numbers that we are concerned with here), the change in the density at each 
point in space may be regarded as simply the result of a shift of the lattice 
by an amount equal to the local value of the displacement vector. The 
fluctuating density is then written o(r) = eo[r—u(r)], and the correlation 
between its fluctuations at different points r1 and re is determined by the 
mean value 


(e(1) e(r2)) = (eo[t1—u(r1)] golre—u(re)}). (138.1) 
The periodic function go(r) can be expanded as a Fourier series (cf. (133.2)): 
Qo(r) = e+, ere”, (138.2) 


where the b are the vectors of the (plane) reciprocal lattice and the constant 
term @ has been separated from the sum. When these series are substituted 


* The same applies to three-dimensional bodies with one-dimensional period- 
icity, for which the integral (137.9) diverges logarithmically. 

T Here and in the rest of § 138, r = (x, y) is the two-dimensional radius vector 
in the plane of the system. 
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in (138.1) and the result is averaged, the terms containing the products 
Cy With b’ + —b disappear, as will be shown below. The product with 
b’ = —b gives a contribution to (138.1) 


| ob |? exp [ib-(r1—re)] (exp [—ib-(a1—u2))); (138.3) 


for brevity, we write u(r1) = U1, u(re) = Ue. 

The probability distribution for the fluctuations of the displacement 
vector is given by formula (137.2), in which AF, is a quadratic functional 
of u(r). If the values of u(r) for different (discrete) points in space are regarded 
as different fluctuating quantities x, (a = 1, 2, ...), this means that the prob- 
ability distribution for them is Gaussian. We can then use in the averaging 
in (138.3) the formula 


(exp (%aXa)) = exp (Fa0%0(XaXo)) 
(see § 111, Problem), obtaining 


(exp [—ib-(u,—u,)}) = exp ( a 3 bibyxu), (138.4) 
where 
dull) = (Un —t%42) Un —%42)) 
= uu) = (untty2) ne (uit) 


(r = r1—fre). It remains to substitute ui: and ue as the expansions (137.1). 
Noting that the mean values (u,,uy,-) are zero when k’ + —k, and are given 
by (137.11) when k’ = —k, we find 


wate) = 7 [AHP 20 -cos key FEE. (138.5) 


The integral converges for small k, since the factor 1—cos ker oc k? when 
k - 0.' For large k, the integral diverges logarithmically. This divergence 
is actually due only to the fact that the approximations used are invalid for 
large k: when kK Z Kuaxs ACKmnax ~ T (where c is the velocity of sound; 
see § 110), the fluctuations cease to be classical. (At low temperatures this 
condition is violated sooner than k « 1/a, where a is the lattice constant.) 
Noting also that for large k the term containing the rapidly oscillating factor 
cos key in the integrand may be omitted, we find 


Xu(¥) = hi log Kmax!3 (138.6) 


+ If we look into the origin of this factor, we see that it arises from the fact 
that b’ = —b in (138.3). It is easy to see that when b’ + —b there will not be 
cancellation in the integrand and the integral will diverge. Since these integrals 
occur in the exponent (cf. (138.4)), their divergence means that the corresponding 
contributions to the correlation function are zero. 
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the bar over A, denotes averaging over the directions of the vector k in the 
plane. 

The required correlation function is now obtained by substituting (138.6) in 
(138.3) and (138.4) and summing over b; the asymptotic law of decrease of 
this function with increasing distance r is given by the least rapidly mere 
term in the sum: 


(o(11) o(r2))— @ — —— cos ber, 


ee, (138.7) 
= bb, Aj/2x, 


where b is to be taken as the reciprocal lattice basic vector for which «, has 
its least value. 

Thus, in a two-dimensional lattice, the correlation function does tend to 
zero as r - oo (unlike a three-dimensiona! Jattice, where it tends to a finite 
limit), but only according to a power law, and more slowly at low temper- 
atures. Note that the correlation function (138.7) depends on the direction 
of r, and the system therefore remains anisotropic. 

Similar but slightly more laborious calculations give a law of the same type 
for thecorrelation functionina three-dimensional system with density function 
o(x). For comparison, it may be recalled that in an ordinary liquid the correla- 
tion function decreases much more rapidly, in fact exponentially (see § 116). 


§ 139. Symmetry with respect to orientation of molecules 


The condition @ = constant is necessary but certainly not sufficient for a 
body to be isotropic. This is clear from the following example. Let us imagine 
a body consisting of elongated molecules, all positions in space of a molecule 
as a whole (i.e. of its centre of mass) being equally probable, but the axes of 
the molecules being predominantly oriented in one direction. Such a body is 
obviously anisotropic, despite the fact that @ = constant for each atom pres- 
ent in the molecule. 

The property whose symmetry is here under consideration may be formu- 
lated in terms of a mutual correlation between the positions of the different 
atoms. Let 012 dV2 be the probability of finding an atom 2 in the volume 
element dV» for a given position of atom 1 (atoms of different types usually 
being involved); 12 is a function of the coordinates r; and re of the two 
atoms, and the symmetry properties of this function determine the symmetry 
of the body (in which 9 = constant). 


t A correlation function of this form was found by T. M. Rice (1965) for a two- 
dimensional superconductor, and by B. Tancovici (1967) and V. L. Berezinskil 
(1971) for a two-dimensional lattice. 
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The fact that the density function @ is constant signifies that a relative dis- 
placement of parts of the body (without change of volume) does not affect 
the equilibrium state of the body, i.e. does not change its thermodynamic 
quantities. This is precisely the characteristic property of liquids (and gases). 
We must therefore regard bodies with @ = constant and an anisotropic 
correlation function 012 as forming a distinct category, liquid crystals (an- 
isotropic fluids). These include bodies whose molecules are anisotropically 
distributed in respect of their spatial orientation. 

Two types are possible as regards the symmetry of this distribution. For 
one type (nematic liquid crystals) the correlation function depends only on 
the differetite Tis = 11—Tra; when the length of this vector varies without 
change in its direction, no periodicity of the correlation function ‘occurs, 
though it may undergo oscillations which become less as rye increases. 
The function thus has no translational symmetry, and its symmetry group 
must consist of various rotations and reflections, i.e. is a point group. 
Geometrically, it may be any point group, with symmetry axes of any order, 
but apparently all the known nematic liquid crystals have an axis of complete 
axial symmetry, and the two directions along this axis are equivalent. The 
point groups C,,, D.. and D,,, have these properties.’ We shall see in 
§ 140, however, that the symmetry D,, (which has no plane of symmetry) 
makes the state of the liquid crystal unstable, and there consequently must 
appear a certain “secondary” periodic structure typical of liquid crystals of 
the other (cholesteric) type. 

Besides these two types, there exist other anisotropic liquids with various 
layered structures, which are usually classed together as smectic liquid crys- 
tals. It seems that at least some of these have a density function o(X) that is 
periodic in one direction only. Such substances may be regarded as consist- 
ing of freely moving plane layers at equal distances apart. In each layer the 
molecules have an ordered orientation, but the configuration of their centres_ 
of mass is random. 

~~ Te has Béén StOwn in § 137 that structures with a one-dimensional perio- 
dicity of the density function are smoothed out by thermal fluctuations. 
The divergence of these fluctuations is, however, only logarithmic. Although 
this excludes the possibility of a one-dimensional periodicity extended to 
arbitrarily large distances, it does not exlude its existence over comparatively 
small but nevertheless macroscopic regions of space, as already noted at the 
end of § 137. 

Finally, it may be mentioned that in ordinary isotropic liquids also there 
are two different types of symmetry. If the liquid consists of a substance 
which does not have stereoisomers, it is completely symmetrical not only 


t In the other groups with axial symmetry (C,, and C,,,) the two directions along 
the axis are not equivalent. Such liquid crystals would in general be pyroelectric. 
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under a rotation through any angle about any axis but also under a reflection 
in any plane, i.e. its symmetry group is the complete group of rotations about 
a point, together with a centre of symmetry (group K,). If the substance has 
two stereoisomeric forms, however, and the liquid contains different num- 
bers of molecules of the two isomers, it will not possess a centre of symmetry 
and therefore will not allow reflections in planes. Its symmetry group is just 
the complete group of rotations about a point (group K). 


§ 140. Nematic and cholesteric liquid crystals 


The orientational symmetry of nematic liquid crystals is uniaxial: at every 
point in the liquid there is only one preferred direction of orientation of the 
molecules, namely that of axjal symmetry. The macroscopic state of such a 
body can therefore be described by specifying one unit vector n(r) at each 
point, having this preferred direction; it is called the director. In a complete 
equilibrium state the body is homogeneous, i.e. 2 = constant. Inhomogene- 
ous distributions n(r) describe various deformational states of the liquid 
crystal. 

In a macroscopic deformation, n(r) varies slowly through the body; the 
characteristic dimensions of the deformation are large compared with molec- 
ular dimensions. The derivatives of n(r) with respect to the coordinates are 
therefore small quantities of an order that increases with that of the deriva- 
tive. Representing the total free energy of the deformed liquid crystal (at a 
given temperature) as an integral F, = | Fay, we can expand the free 
energy density F in powers of the derivatives of a(r) (C. W. Oseen, 1933; 
F. C. Frank, 1958). 

The expansion of the scalar quantity F can contain only scalar combina- 
tions of the components of the vector n and its derivatives. There are only 
two scalar combinations linear in the first derivatives: the true scalar div n 
and the pseudoscalar n-curl n. The former, on integration over the volume, 
becomes an integral over the surface of the body, and is therefore unimpor- 
tant in considering the bulk properties of the substance. 

True scalars quadratic in the first derivatives can be found by taking the 
rank-four tensor 

Ong On) 
Ox; OXm 


and forming invariants from it by contraction with respect to pairs of 
suffixes or multiplication by components of the vector n. It must be noted 
here that n is a unit vector, so that 
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In this way we find the invariants 


2 On On, : 9 Om On; 
[(negrad)n]’, Ox, Ox” (div n)’, On Okc. 


The last two, however, differ only by a divergence: 


it Cie Se OB: 3 0 [Oe OL 
Ox; OX~ OX; OXe Ox; ( ‘Ox, a) 


so that their contributions to the total free energy differ only by an integral 
over the surface of the body, which is of no interest here (J. L. Ericksen, 
1962). The invariant‘ 
oo ee = (o-curl n)? + (div nn)’, 

and (necur! n)? may therefore be taken as the independent invariant. Lastly, 
there is a pseudoscalar quadratic in the first derivatives: (necurl n) div n.t 

Scalars linear in the second derivatives are of the same order of smallness 
as the above, but integration by parts reduces all of them to forms quadratic 
in the first derivatives. 

Thus we have the following expression for the free energy density of a 
liquid crystal: 


F = Fo+bn-curl n+ 4a;(div a)? + $40(n-curl a)? 
+ +4as{(a-grad)n]?+ @1o(n-curl n) div n, (140.1) 


where b, a1, a2, a3 and aig are (temperature-dependent) constants. 

As already mentioned in § 139, the directions of n and —n are equivalent 
in all known liquid crystals of these types; to meet this condition, we must 
put ay2 = 0. Moreover, if the symmetry elements of the crystal include 
planes, b = 0: since necurl n is a pseudoscalar and the free energy is a true 
scalar, the coefficient b must be a pseudoscalar; but a medium having planes 
of symmetry cannot have pseudoscalar characteristics, since reflection in the 
' plane would give b = —b. Thus the free energy of a nematic liquid crystal is 


F = Fo+$ai(div a)? + $a2(n-curl n)?+4a3[(m-grad)a]?. (140.2) 


All three coefficients a1, ae, a3 must be positive. Then n = constant corre- 
sponds to the equilibrium state. 


+ This is easily proved by expressing in components and taking the z-axis in the 
direction of n at the relevant point in space (then O0n,/Ox, = 0). 

1 The product (negrad)n-curl n= 0, since from grad n? = 0 it follows that 
(n-grad)n == —nxcurl! n. 
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If the liquid crystal does not have a plane of symmetry, then b # 0.' We 
can rewrite (140.1) (with ai2. = 0) as 


F = Fo+4a;(div a)? + 4a2(n-curl n+9o)?+ 4as[(m-grad)n]?, (140.3) 


where go = b/a2 (and the constant — b?/2a¢ is included in Fo). The equilibrium 
state of such a substance corresponds to a distribution of directions of the 
director such that 


diva = 0, (m-gradn = 0, n-curln = —qo. 
These equations have the solutions 
nN, = 0, my =COSGox, Nz = SiN Gox. (140.4) 


This structure (corresponding to cholesteric liquid crystals) can be imag- 
ined as the result of a uniform twisting about the x-axis of a nematic medium 
originally oriented with n = constant in one direction in the yz-plane. 
The orientational symmetry of a cholesteric crystal is periodic in one direc- 
tion (the x-axis) in space, so that the correlation function 012 = 012(x, r12). 
The vector n reverts to its original value after each interval of length 27/qo 
along the x-axis; but since the directions of a and —n are physically equiva- 
lent, the actual period of repetition of the structure is is t/qo. A structure of 
this type is usually said to be helicoidal. 

The above theory is, of course, valid only if the period of the helicoidal 
structure is large in comparison with molecular dimensions. This condition 
is in fact satisfied in cholesteric liquid crystals (the period 2/q ~ 107 s * cm). 


§ 14].. Fluctuations in liquid crystals 


Let us consider the fluctuations of the direction of the director n in a nematic 
liquid crystal (P. G. de Gennes, 1968). 

We express n as No +v, where a9 = fi is the equilibrium direction, constant 
throughout the volume, and vy = An is the fluctuation from this value. Since 
n” = n? = 1, nop = 0, ie. the vector p is perpendicular to nj. Accordingly, 
the correlation function of the fluctuations, 


(va(F1) Yp(F2))s (141.1) 


is a two-dimensional tensor in the plane perpendicular to mo (a and £ being 
vector suffixes in that plane). In a homogeneous but anisotropic liquid, this 


t This symmetry will always occur for a liquid crystal consisting of one stereoiso- 
mer of a substance that has molecules with mirror-type asymmetry (such as all the 
known cholesteric liquid crystals). Crystals consisting of two different stereoiso- 
mers of the same substance differ in the sign of the constant b. 
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function depends not only on the magnitude of the vector r = rg—ri but 
also on its direction. 

A magnetic field has a strong influence on the fluctuations of the director. 
This is due to the presence, in the free energy density of a liquid crystal, of a 


further term 
Frnagn = —37{n-H)*, (141.2) 


which depends on the vector n itself, not on its derivatives as in (140.2). 
If y, > 0, the equilibrium direction of n is the same as that of the field, but 
if 7, < O it is in the plane perpendicular to the field. We shall take the case 
where 7, > 0, so that ny||H. Then (n-H)? = H°(1— v); omitting the term 
independent of v, we can write 

Fraga = F%qlT*v? (141.3) 


Taking F from (140.2) and (141.3), and retaining only quantities of the 
second order in v, we get the following expression for the change in the 
total free energy in a fluctuation: 


AF, = + § {a,(div v)? + a,(curl, v)? + a,(v/Ox)? + 4qH?v"} dV, (141.4) 


with the x-axis in the direction of no. It must be emphasised that, by using 
the expression (140.2) for the energy of the deformed crystal, we thereby 
limit the treatment to fluctuations with wavelengths much longer than 
molecular dimensions. 

We now proceed as in § 116, expressing the fluctuating quantity v(r) as a 
Fourier series in the volume V: 


v= pens, Vie = VE. (141.5) 


After substitution of this series, the expression (141.4) forms a sum of 
terms (4F,),, each of which depends only on the component », with a partic- 
ular value of k. Taking the xy-plane to contain the direction of k (and of 
H), we get 
(AF ye = EV {(ark? + ak? + aH?) | ry? 
+ (ack? + ask? + Nall’) Vex |?}. 
Hence (cf.§ 116) we find for the mean square fluctuations 
(| Pel?) = T/V (aik} + ask + Yall”), 
(| P|®) = T/V (aok} + oak + Yall”), (141.6) 
(VycVste) = Q, 
t In a uniaxial anisotropic medium, the magnetic susceptibility is a tensor 
Lie = XoO+Xaltity, and the magnetisation of the substance gives rise to a free 


energy term — 3%, The quantity (141.2) is the part of this contribution that 
depends on n. 
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We see that, in the absence of the field, the fluctuations of the Fourier 
components py, increase without limit as k -- 0 (the integrals over d3k, which 
give the mean square of the vector »v itself, remain finite, however). The 
application of the magnetic field suppresses the fluctuations that have wave 
numbers k < H(y,/a)'* (where ais the order of magnitude of the coefficients 
1, Ga, as). 

The correlation function (141.1) can be calculated from (141.6), using the 


formula 
(ve(¥1) Yate) = fe *(vavheyV dk /(27)? ; (141.7) 


cf. (116.13). We shall not pause to give the fairly lengthy formula resulting 
from the integration,+ but only mention that, in the absence of the field, the 
correlation function decreases as 1/r with increasing distance r = |ra—ry]. 
When the field is present, the decrease becomes exponential, with correlation 
radius r, ~ (a/z,)'" 471. 

The fluctuations of the direction of the director in a cholesteric liquid 
crystal can be treated similarly, and only a brief commentary will be given 
here. 

In a cholesteric medium, we can distinguish the fluctuations of the local 
direction of the helicoidal structure axis and those of the phase (the angle of 
rotation of the vector n about this axis). Fluctuations of the first type are 
finite, but the mean square fluctuation of the phase is (in the absence of a 
magnetic field) logarithmically divergent as k -- 0. In this respect, the fluc- 
tuations in a medium with one-dimensional periodicity of the orientational 
structure are analogous to those in a medium with one-dimensional perio- 
dicity of the particle configuration (§ 137). Strictly speaking, such periodicity 
is consequently impossible in a medium of arbitrarily great extent, but, 
on account of the large value of the helicoidal structure period in cholesteric 


t This type of fluctuation is analogous to the behaviour of the density fluctua- 
tions in an ordinary liquid near its critical point and that of the order parameter 
fluctuations near a phase transition point of the second kind (see § 146 and 152 
below). Whereas in these latter cases the factor that suppresses the fluctuations is 
the “distance” from the points concerned, here it is the external magnetic field, 
which is independent of the temperature. Note that it is precisely the increase of 
the fluctuations of n when k is small that enables us to treat these fluctuations 
independently of those of other quantities. In this connection, it is important that 
we are not considering the neighbourhood of phase transition points of the second 
kind. Near such points, the fluctuations of other characteristics of the transition 
also increase, and the fluctuations of n cannot in general be dealt with independently 
of the others. It should also be emphasised that the increase of the fluctuations 
does not lead to any limitations on the range of applicability of (141.6), whereas 
the validity of formula (146.8), for example, is subject to the inequality (146.15). 


t In deriving this, the expressions (141.6) must, of course, be rewritten in a form 
that does not involve a particular choice of the coordinate axes. 
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liquid crystals, the divergence of the fluctuations would occur only for such 
huge dimensions that the whole question is a purely academic one. 

We may add a few words about fluctuations in smectic liquid crystals 
consisting of regularly arranged plane layers. As already mentioned in 
§ 139, such a structure is smoothed out by thermal fluctuations, and can 
therefore occur only in restricted volumes. It is interesting to note, however, 
that these fluctuations are suppressed by a magnetic field. The origin of this 
effect is as follows. 

In each layer, the molecules are oriented in an ordered manner, with a 
preferred direction given by the director n; let this be normal to the surface 
of the layer. When fluctuations occur, the surface of the layers is deformed 
and the director is rotated. Let u be the displacement vector of points in a 
layer, and y again the change in the director (n = no +). In long-wavelength 
deformations, the layer may be regarded as a geometrical surface, and the 
small quantities y and u are then related by y = —grad (uemo) (the change of 
direction of the normal to the surface); for their Fourier components we have 
v,, = —ix(u,,°y), where # is the component of k in the plane of the layer. 
In the presence of the magnetic field, the change in direction of the director 
makes an additional contribution (141.3), proportional to v*, in AF,. This 
in turn has the result that the integral (137.9) which gives the mean square 
of the fluctuational displacement acquires a further term ~ x? in the denomi- 
nator of the integrand (together with a term in x*), and in consequence the 
integral no longer diverges. 


Lastly, let us consider fluctuations in two-dimensional liquid-crystal 
systems (films). In such a system, the orientation of the molecules is 
specified by a director n lying in the plane of the film. If we consider 
its fluctuations (with wave vectors k in the plane of the film), we get an 
expression analogous to (141.6): in the absence of a field, (x) oc 1/(k,, k,), 
where $(k,, k,) is a quadratic function of the components of k. But to 
find the total fluctuation (vy?) this expression must now be integrated over 
d?k oc k dk, and the integral diverges logarithmically. Thus the thermal 
fluctuations smooth out such a two-dimensional liquid-crystal structure. 
As with a two-dimensional solid crystal (§ 137), however, the logarithmic 
divergence does not exclude the existence of such a structure in regions of 
finite size. 


CHAPTER XIV 


PHASE TRANSITIONS OF THE SECOND KIND 
AND CRITICAL PHENOMENA 


§ 142. Phase transitions of the second kind 


Ir was already been mentioned in § 83 that the transition between phases of 
different symmetry (crystal and liquid; different crystal modifications) 
cannot occur in a continuous manner such as is possible for a liquid and a 
gas. In every state the body has either one symmetry or the other, and there- 
fore we can always assign it to one of the two phases. 

The transition between different crystal modifications is usually effected by 
means of a phase transition in which there is a sudden rearrangement of the 
crystal lattice and the state of the body changes discontinuously. As well as 
such discontinuous transitions, however, another type of transition involv- 
ing a change of symmetry is also possible. 


series 
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To elucidate the nature of these transitions, let us consider a specific 
example. At high temperatures, BaTiOs has a cubic lattice whose unit cell is 
as shown in Fig. 60 (the barium atoms are at the vertices, the oxygen atoms 
at the centres of the faces, and the titanium atoms at the centres of the cells). 
As the temperature decreases below a certain value, the titanium and oxygen 
atoms begin to move relative to the barium atoms parallel to an edge of the 
cube. It is clear that, as soon as this movement begins, the symmetry of the 
lattice is affected, and it becomes tetragonal instead of cubic. 

This example is typical in that there is no discontinuous change in state of 
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the body. The configuration of atoms in the crystal’ changes continuously. 
However, an arbitrarily small displacement of the atoms from their original 
symmetrical positions is sufficient to change the symmetry of the lattice. The 
resulting transition from one crystal modification to another is called a 
phase transition of the second kind, in contrast to ordinary phase transitions, 
which in this case are said to be of the first kind.t 

Thus a phase transition of the second kind is continuous in the sense that 
the state of the body changes continuously. It should be emphasised, how- 
ever, that the symmetry, of course, changes discontinuously at the transition 
point, and at any instant we can say to which of the two phases the body 
belongs. But whereas at a phase transition point of the first kind bodies in 
two different states are in equilibrium, the states of the two phases are the 
same at a transition point of the second kind. 

As well as cases where the change in symmetry of the body occurs by a 
displacement of the atoms (as in the example given above), the change in 
symmetry in a phase transition of the second kind may result from a change 
in the ordering of the crystal. It has already been mentioned in § 64 that the 
concept of ordering arises if the number of lattice points that can be occu- 
pied by atoms of a given kind exceeds the number of such atoms. We shall 
use the word “own” for the places occupied by atoms of the kind in question 
in a completely ordered crystal, in contrast to the “other” places which are 
taken by some of the atoms when the crystal becomes disordered. In many 
cases, which will be of interest in connection with transitions of the second 
kind, it is found that the “own” and “other” lattice sites are geometrically 
identical and differ only in that they have different probabilities of contain- 
ing atoms of the kind in question.’ If now these probabilities become equal 
(they will not be unity, of course), all such sites become equivalent, and there- 
fore new symmetry elements appear, i.e. the symmetry of the lattice is in- 
creased. Such a crystal will be said to be disordered. 

The foregoing may be illustrated by an example. The completely ordered 
alloy CuZn has a cubic lattice with the zinc atoms at the vertices, say, and the 
Copper atoms at the centres of the cubic cells (Fig. 61a; a simple cubic Bra- 
vais lattice). When the alloy becomes disordered, copper and zinc atoms 


t To simplify the discussion, we shall conventionally speak of the configuration 
of the atoms or its symmetry as if the atoms were at rest. In reality we should 
speak of the probability distribution for various configurations of the atoms in 
space, and of the symmetry of this distribution. 

t Phase transition points of the second kind are also called Curie points, especi- 
ally when they are associated with a change in the magnetic structure of the body. 

§ We may note that in this case it can always be assumed that the probability 
of finding an atom at one of its own sites is greater than at one of the other sites 
simply because, if it were not, we could transpose the nomenclature of the sites. 
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change places, i.e. non-zero probabilities of finding atoms of either kind 
exist at every lattice site. Until the probabilities of finding copper (or zinc) 
atoms at the vertices and at the centres of the cells become equal (that is, 
while the crystal is not completely ordered) ,these sites remain non-equiv- 
alent, and the symmetry of the lattice is unchanged. But when the probabil- 
ities become equal, all sites become equivalent, and the symmetry of the 
crystal is raised: a new lattice vector appears, from a vertex to the centre of 
a cell, and the crystal acquires a body-centred cubic Bravuais lattice (Fig. 
61b).7 


So far we have discussed only transitions between different crystal modi- 
fications, but phase transitions of the second kind need not necessarily in- 
volve a change in symmetry of the configuration of atoms in the lattice. 
A transition of the second kind can also bring about a transformation be- 
tween two phases differing in some other property of symmetry, as for exam- 
ple at the Curie points of ferromagnetic or antiferromagnetic substances. 
In this case there is a change in symmetry of the arrangement of the elemen- 
tary magnetic moments in the body, or more precisely a disappearance of 
the currents j in it; see the first footnote to § 128. Other phase transitions 
of the second kind are the transition of a metal to the superconducting state 
(in the absence of a magnetic field) and that of liquid helium to the super- 
fluid state. In both these cases the state of the body changes continuously, 
but it acquires a qualitatively new property at the transition point. 

Since the states of the two phases are the same at a transition point of the 
second kind, it is clear that the symmetry of the body at the transition point 
itself must contain all the symmetry elements of both phases. It will be 


t Cases are in principle possible where the occurrence of ordering does not cause 
a change in the symmetry of the crystal. A phase transition of the second kind is 
then impossible: even if the transition from the ordered to the disordered state of 
the crystal were to occur continuously, there would still be no discontinuity of 
specific heat (see below). In such cases a phase transition of the first kind is, of 
course, possible. 
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shown below that the symmetry at the transition point itself is the same as 
the symmetry everywhere on one side of that point, i.e. the symmetry of one 
of the phases. Thus the change in symmetry of the body in a phase transition 
of the second kind has the following very important general property: 
the symmetry of one phase is higher than that of the other.’ It should be 
emphasised that in a phase transition of the first kind the change in symmetry 
of the body is subject to no restriction, and the symmetries of the two phases 
may be unrelated. 

In the great majority of the known instances of phase transitions of the 
second kind, the more symmetrical phase corresponds to higher tempera- 
tures and the less symmetrical one to lower temperatures. In particular, a 
transition of the second kind from an ordered to a disordered state always 
occurs with increasing temperature. This is not a law of thermodynamics, 
however, and exceptions are therefore possible.? 

For brevity we shall arbitrarily call the more symmetrical phase simply 
the symmetrical one, and the less symmetrical phase the unsymmetrical one. 

To describe quantitatively the change in the structure of the body when it 
passes through the phase transition point, we can define a quantity 7, 
called the order parameter, in such a way that it takes non-zero (positive 
or negative) values in the unsymmetrical phase and is zero in the symmetrical 
phase. For instance, in transitions where the atoms are displaced from their 
positions in the symmetrical phase, 7 may be taken as the amount of this 
displacement. For transitions with a change in the ordering of the crystal 
(e.g. in the CuZn alloy mentioned above), the parameter 7 may be defined 
as 

1 = (Wcu—Wza)/(Wou + Wza), 


where Wo, and wz, are the probabilities of finding a copper atom and a zinc 
atom respectively at any given lattice site. For magnetic transitions, 7 may 
be taken as the macroscopic magnetic moment per unit volume of a ferro- 
magnet or the magnetic moment of the sub-lattice for an antiferromagnet. 

It must again be emphasised that the symmetry of the body is changed 
(namely, increased) only when 7 becomes exactly zero; any non-zero value 
of the order parameter, however small, brings about a lowering of the 


t It will be recalled that the term “higher symmetry” refers to a symmetry which 
includes all thesymmetry elements (rotations, reflections and translational periods) 
of the lower symmetry, together with additional elements. 

The condition mentioned is necessary but not sufficient for a phase transition 
of the second kind to be possible; we shall see later that the possible changes of sym- 
metry in such a transition are subject to further restrictions. 

t One exception, forexample, is the “lower Curie point” of Rochelle salt. below 
which the crystal is orthorhombic, but above which it is monoclinic. 
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symmetry. A passage through a phase transition point of the second kind 
has a continuous change of 7 to zero. 

The absence of any discontinuous change of state at a phase transition 
point of the second kind has the result that the thermodynamic functions of 
the state of the body (its entropy, energy, volume, etc.) vary continuously as 
the transition point is passed. Hence a phase transition of the second kind, 
unlike one of the first kind, is not accompanied by evolution or absorption 
of heat. We shall see, however, that the derivatives of these thermodynamic 
quantities (i.e. the specific heat of the body, the thermal expansion coefficient, 
the compressibility, etc.) are discontinuous at a transition point of the second 
kind. 

We must remember that mathematically a phase transition point of the 
second kind is a singularity of the thermodynamic quantities, and in partic- 
ular of the thermodynamic potential ®; the nature of this singularity is 
discussed in §§ 148 and 149. In order to see this, let us first recall that a 
phase transition point of the first kind is not a singularity; it is a point at 
which the thermodynamic potentials ®,(P, 7’) and ®2(P, 7) of the two phases 
are equal, and each of the functions ®; and ®z on either side of the transition 
point corresponds to an equilibrium (though possibly metastable) state of 
the body. In a phase transition of the second kind, however, the thermody- 
namic potential of each phase, if formally regarded on the far side of the 
transition point, corresponds to no equilibrium state, i.e, to no minimum of 
®; we shall see in§ 143 that the thermodynamic potential of the symmetrical 
phase would indeed correspond to a maximum of ® beyond the transition 
point. 

This last result implies that superheating and supercooling effects are 
impossible in phase transitions of the second kind (whereas they can occur 
in ordinary phase transitions). In this case neither phase can exist beyond 
the transition point (here we ignore, of course, the time needed to establish 
the equilibrium distribution of atoms, which in solid crystals may be con- 
siderable). 


PROBLEM 


Let c be the concentration of atoms of one component of a binary solid solu- 
tion, and cy the concentration of these atoms’ “own” sites. If c + cy the crystal 
cannot be completely ordered. Assuming the difference c—c, small and the crystal 
almost completely ordered, determine the concentration 2 of atoms at “other” 
sites, expressing it in terms of the value A) which it would have at c = ey for given 
P and T (C. Wagner and W. Schottky, 1930). 


SOLUTION. Considering throughout only the atoms of one component, we use the 
concentration A of atoms at other sites and the concentration 2’ of their own sites 
not occupied by these atoms; concentrations are defined with respect to the total 
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number of all atoms in the crystal. Clearly 
C—Cy = AN, (1) 
We shall regard the crystal as a “solution” of atoms at other sites and of own 
sites not occupied by atoms, the “solvent” being represented by atoms at their 
“own” sites. The transition of atoms from “other” to their “own” sites can then 
be regarded as a “chemical reaction” between the “solutes” (with small concentra- 
tions 2 and 1’) to form the “solvent” (with concentration = 1). Applying to this 
“reaction” the law of mass action, we obtain Aw’ = K, where K depends only on 
P and T. For c = cy we must have A = A’ = dy; hence K = 22, and so 


AA Sa (2) 
From (1) and (2) we find the required concentrations: 
A= 3[(c—eg) +-V{(e— eo? +429}, 
N= $[—(c—) +0/{(o— 0)? +4433]. 
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The quantitative theory of phase transitions of the second kind is based 
on a consideration of the thermodynamic quantities of the body for given 
deviations from the symmetrical state (i. e. for given values of the order 
parameter 7); we can represent the thermodynamic potential of the body, 
for example, as a function of P, T and n. Here it must of course be remem- 
bered that in the function ®(P, T, 7) the variable 7 is in one sense not on the 
same footing as the variables P and T; whereas the pressure and temperature 
can be specified arbitrarily, the value of 7 which actually occurs must itself 
be determined from the condition of thermal equilibrium, i.e. the condition 
that © is a minimum (for given P and T). 

The continuity of the change of state in a phase transition of the second 
kind is expressed mathematically by the fact that the quantity 7 takes arbi- 
trarily small values near the transition point. Considering the neighbourhood 
of this point, we expand ©(P, T, 7) in powers of 7: 


O(P, T, n) = Potan+Arn?+Cre+Brit+ ..., (143.1) 


where the coefficients «, A, B, C, ... are functions of P and T. 

It must emphasised, however, that writing ® as the regular expansion 
(143.1) does not take account of the fact already mentioned that the tran- 
sition point is a singularity of the thermodynamic potential; the same is true 
of the later expansion of the coefficients in (143.1) in powers of the tempera- 
ture. This section and §§ 144-146 are concerned with a theory’ based on the 


t This theory is due to L. D. Landau (1937), who also first showed the general 
relation between phase transitions of the second kind and the change in the sym- 
metry of the body. 
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validity of such expansions; the conditions for it to be applicable will be 
considered in § 146. 

It can be shown (see§ 144) that, if the states with 7 = O and 7 + O are of 
different symmetry (as we assume), the first-order term in the expansion 
(143.1) is identically zero: « = 0. The coefficient A(P, 7) in the second-order 
term is easily seen to vanish at the transition point, since in the symmetrical 
phase the value 7 = 0 must correspond to a minimum of @, and for this to 
be so it is evident that A > 0 is necessary, while on the other side of the 
transition point, in the unsymmetrical phase, non-zero values of 1 must 
correspond to the stable state (i. e. to the minimum of ®), and this is possible 
only if A < 0; Fig. 62 shows the form of the function ®(7) for 4 < 0 and 
A > 0. Since A is positive on one side of the transition point and negative 
on the other, it must vanish at the transition point itself. 


® 
| 
| 
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But if the transition point itself is a stable state, i. e. if ® as a function of 
n is a minimum at 7 = 0, it is necessary that the third-order term should be 
zero and the fourth-order term positive there: 


A{P,T)=0, C(P,T)=0, BA(P,T) > 0, (143.2) 


where the suffix c refers to the transition point. 

The coefficient B, being positive at the transition point, is of course also 
positive in the neighbourhood of that point. 

Two cases can occur. In one, the third-order term is identically zero owing 
to the symmetry of the body: C(P, T) = 0. Then there remains at the 
transition point only the one condition A(P, T) = 0, which determines P as a 
function of T or vice versa. Thus in the PT-plane there is a line of phase 
transition points of the second kind.* 


1 This condition, however, needs to be made more precise; see the sixth footnote 
to§ 145. 
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If, however, C is not identically zero, the transition points are deter- 
mined by the two equations A(P, 7) = 0, C(P,T) = 0. In this case, therefore, 
the continuous phase transitions can occur only at isolated points.' 

The most interesting case is, of course, that where there is a line of con- 
tinuous-transition points. In what follows we shall take the discussion of 
phase transitions of the second kind to refer only to this case, which will now 
be considered. It includes, in particular, transitions due to the appearance 
or disappearance of a magnetic structure. This is because of the symmetry 
under time reversal. The thermodynamic potential of the body cannot be 
altered by this transformation, whereas the magnetic moment (which here 
acts as the order parameter) changes sign. It is therefore clear that in such 
cases the expansion of ® contains no odd-order terms. 

We shall therefore suppose that C = 0 and the expansion of the thermo- 
dynamic potential has the form 


OP, T, 1) = Do P, T)+ ACP, Ty? + BCP, Trt. (143.3) 


Here B > 0, while the coefficient A > 0 in the symmetrical phase and A < 0 
in the unsymmetrical phase; the transition points are determined by the 
equation A(P, 7) = 0. 

In the theory given here, it is assumed that the function A(P, T) has no 
singularity at the transition point, so that it can be expanded near this point 
in integral powers of the “distance” from the transition: 


A(P, T) = a(P)(T —T.), (143.4) 


where T, = T,(P) is the transition temperature. The coefficient B(P, 7) may 
be replaced by B(P, T,). The expansion of the thermodynamic potential 
therefore becomes 


P(P,T) = Dol P, T)+a(P)(T-T.)n? + B(P)n', (143.5) 


with BCP) > 0. 

The dependence of 7 on the temperature near the transition point, in the 
unsymmetrical phase, is determined from the condition for ® to be a mini- 
mum as a function of 7. Equating the derivative 0©/0y to zero, we obtain 
n(A +2Br?) = 0, and hence 


7? = —A/2B = a(T,—T)/2B: (143.6) 


* It can be shown (L. Landau, Zhurnal éksperimental’not i teoreticheskoi fiziki 
7, 627, 1937; translation in Collected Papers of L. D. Landau, p.209, Pergamon, 
Oxford, 1965) that there is always a third-order term in the expansion for the 
transition between an isotropic liquid and a solid crystal. 
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the solution 7 = 0 with A < 0 corresponds to a maximum of ®, not a min- 
imum. It should be noted that the configuration of the two phases on the 
temperature scale depends on the sign of a: for a > 0 and a < 0, the un- 
symmetrical phase corresponds to the temperatures T< T, and T > T, 
respectively. We shall assume below the particular case where the symmet- 
rical phase is at T > T,, as happens in the great majority of cases, and 
accordingly that a > 0. 
Neglecting higher powers of 7, we find for the entropy 


S = —6/aT = Sy—(@A/OT)r? ; 


the term containing the temperature derivative of n is zero, because 
d0/on = 0. In the symmetrical phase, 7 = 0 and S = So; in the unsym- 
metrical phase, 

S = So+(a?/2B) (T—T,). (143.7) 


At the transition point itself, this expression becomes So, and the entropy is 
therefore continuous, as it should be. 

Finally, let us determine the specific heats C, = T(0S/8T)p of the two 
phases at the transition point. For the unsymmetrical phase we have, differ- 
entiating (143.7), 
Cy = Coo +a°T/2B. (143.8) 


For the symmetrical phase S = Sj, and therefore C, = C,). Thus the spe- 
cific heat is discontinuous at a phase transition point of the second kind. 
Since B > 0, C, > C,, at the transition point, i.e. the specific heat increases 
in going from the symmetrical to the unsymmetrical phase, whatever their 
respective positions on the temperature scale. 

Other quantities besides C, are discontinuous: C,, the thermal expansion 
coefficient, the compressibility, etc. There is no difficulty in deriving relations 
between the discontinuities of all these quantities. First of all we note that 
the volume and the entropy are continuous at the transition point, i.e. 
their discontinuities AV and AS are zero: 


AvV=0, AS=0. 


We differentiate these equations with respect to temperature along the curve 
of transition points, i.e. assuming the pressure to be the function of tem- 
perature given by this curve. The result is 


A(8V /8T)p-+(4P/4T) A(8V /OP)r = 0, (143.9) 
AC, /T—(dP/AT) A(8V /8T)p = 0, (143.10) 


since (AS/0P), = —(OV/OT)p. These two equations relate the discontinuities 
of the specific heat C,, the thermal expansion coefficient and the compressi- 


§ 143 The Discontinuity of Specific Heat 455 


bility at a phase transition point of the second kind (W. H. Keesom and 
P. E. Ehrenfest, 1933). 

Differentiating along the curve of transition points the equations 4S = 0 
and AP = 0 (the pressure is, of course, unchanged in the transition), but with 
temperature and volume as independent variables, we find 


A(O8P/0T)y + (dV /dT) A(OP/aV)r = 0, (143.11) 
AC,/T + (dV /dT) A(OP/8T)y = 0. (143.12) 
It may be noted that 
AC, = T(dP/dT)* A(—8V/8P)r, (143.13) 
AC, = —T(dV/dTY A(—OV /OP)z', (143.14) 


so that the discontinuities of the specific heat and the compressibility have 
the same sign. From the previous statement about the discontinuity of the 
specific heat, it follows that the compressibility decreases discontinuously 
on going from the unsymmetrical to the symmetrical phase. 

To conclude this section, let us return to the first part and consider the 
meaning of the function ®(P, 7, 7). 

The formal introduction of this function for any values of 7 does not, 
in general, require the possible existence of actual macroscopic states (par- 
tial equilibria) corresponding to such values. We must emphasise, however, 
that such states do in fact exist near a phase transition point of the second 
kind: as the transition point is approached, the minimum of @ as a function 
of 7 becomes steadily flatter. This means that the “restoring force” that 
tends to bring the body to the state having the equilibrium value of 7 be- 
comes steadily weaker, so that the relaxation time for the establishment of 
equilibrium with respect to the order parameter increases without limit (and 
Certainly becomes much longer than the time for equalisation of pressure 
throughout the body). 


PROBLEM 


Find the relation between the discontinuities of specific heat and heat of solution 
in a transition of the second kind in a solution (I. M. Lifshitz, 1950). 


SOLUTION. The heat of solution per molecule of solute is given by q = 
OW /On—wo, where W is the heat function of the solution and wg the heat func- 
tion per particle of the pure solute. Since w, is not affected by the phase transition 
in solution, we have for the discontinuity of q 

o® 


Aq = A@QW/8n) = Ae (ors) = —TAO*®/On OT), 
n oT 


where we have used the fact that the chemical potential u’ = 8©/dn is continuous 
at the transition. On the other hand, differentiation of the equation 4(6®/0T) = 0 
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(continuity of entropy) along the curve of the transition temperature as a function 
of the concentration c at constant pressure gives 


dT, ,2@ OD 
ae are +4 aap = © 


Hence we have the required relation 
N Aq = (AT,/de) AC, : 


We may note that in the derivation of this relation no assumption has been made 
concerning the concentration of the solution. 


§ 144, Effect of an external field on a phase transition 


Let us now consider how the properties of a phase transition change when 
the body is subjected to an external field whose action depends on the value 
of the parameter 7. Without specifying the physical nature of this field, 
we may formulate in general terms the assumptions made about it. These 
amount to asserting that the application of the field is described by the 
appearance in the Hamiltonian of the body of a perturbing operator having 
the form 
A, = —Hhv, (144.1) 
which is linear in the field “strength” A and in the operator 4 of the quantity 
n; V is the volume of the body." If the thermodynamic potential is defined 
as a function of P, T and h, the mean (equilibrium) value of 7 is given by 
Vi = —O@(P, T, h)/oh, (144.2) 
according to the theorem of differentiation with respect to a parameter; cf. 
(11.4) and (15.11). 
In order to ensure that this relation is satisfied in the Landau theory, we 
must add to the expansion (143.5) a term —nhV: 


O(P,T, 4) = Oo P, T) + atn? + Brt—nhvV, (144.3) 
with the notation ¢ = T—T,(P).4 


t For example, in a ferromagnet near its Curie point (the transition to the para- 
magnetic phase), the parameter 7) is the macroscopic magnetic moment per unit 
volume, and the field / is the magnetic field; in a ferroelectric, 7 is the electric dipole 
moment per unit volume of the body, and A the electric field. In other cases, the 
field 4 need not have a direct physical significance, but its formal introduction 
helps towards a fuller understanding of the properties of the phase transition. 

+ In the Landau theory, the equilibrium value of 7(P, 7) is determined by mini- 
mising this expansion, i.e. by the condition 0®(?, T, n)/On = 0. The relation (144.2) 
is then, of course, satisfied: 


oO = (S 0D\ on (0® ee 
an = a) + (Gn) arm (ai) = a 
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We may note first of all that any field, however weak, has the result that 
the parameter 7 becomes different from zero at every temperature. Thus 
the field reduces the symmetry of the more symmetrical phase, and so the 
difference between the two phases disappears. Accordingly, the discrete 
phase transition point also disappears; the transition is “smoothed out”. 
In particular, instead of the sharp discontinuity of the specific heat, there is 
an anomaly spread over a range of temperatures. The order of magnitude 
of this range can be estimated from the condition nhV ~ atn?; taking 7 
from (143.6), we then find 

t~ WABIBY23/g, 

For a quantitative investigation of the transition, we write the equilib- 

rium condition (6®/6n),. , = 0: 
2atn + 4Br> = hV. (144.4) 


The dependence of 7 on the field # is different for temperatures above 
and below T,. It will be recalled that we agreed to take a > 0, so that the 
temperatures ¢ > 0(T > T,) correspond to the symmetrical phase (7 = 0 
for h = 0). 
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+ We are everywhere considering transitions for a given value of the pressure; 
the subscript P indicating that the pressure is constant in the differentiations is 
omitted for brevity. 
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For ¢ > 0, the left-hand side of equation (144.4) increases monotoni- 
cally with 7 (Fig. 63a). The equation therefore has for any given value of h 
just one (rea!) root, which vanishes when h = 0. The function 7(h) is single- 
valued, and the sign of 7 is the same as that of h (Fig. 64a). 

If ¢ < 0, however, the left-hand side of (144.4) is not a monotonic func- 
tion of 7 (Fig. 63b), and so the equation has three different real roots over a 
certain range of values of h, and n(h) is no longer single-valued (Fig. 64b). 
This range evidently has limits given by the condition 


Fe at +4Br?) = 2at + 12Br? = 0, 


and is —h, < h < h,, where 


_ (2\38 (at)? 
h, = (3) ae (144.5) 


It is easy to see, however, that the whole section of the curve BB’ on which 
(87/0h), < 0 corresponds to thermodynamically unstable states: differen- 
tiating equation (144.4) with respect to h, we find 


GB owe 


On? / 1,5 
hence we see that (6°/677),. , < 0 when (01/0h), < 0, ie. ® has a maxi- 
mum here, not a minimum. 

In the sections AB and A’B’, the thermodynamic potential is a minimum, 
but its value is greater than for the minima corresponding to the sections 
A’D’ and AD respectively, as is easily seen by direct calculation, although it 
is also obvious a priori: since the field h appears in ® as the term —yhV, it is 
certainly more favourable thermodynamically for the sign of 7 to be the 
same as that of h. Thus the sections AB and A’B’ correspond to metastable 
states of the body, and the true equilibrium form of the function 7(h) is 
given by the continuous curve DAA’D’ in Fig. 64b, all points of which 
correspond to thermodynamically stable states. If at a given temperature 
t < 0 the field is varied, a phase transition of the first kind occurs when it 
passes through h = 0: at this point phases with values of n = +(a|t|/2By'” 
opposite in sign are in equilibrium together. 

The susceptibility is determined as the derivative 


% = (0n/Oh)z;4--0- (144.7) 
Differentiation of (144.4) gives 
An/oh = V/(2at +12Br?), 
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and substitution (as h + 0) of 4? = 0 for t > 0 and 7? = —at/2B for t < 0 
leads to the result 


y= V/2at for t>0, y=V/—-4at for +<0. (144.8) 


The infinite value of x as ¢ -- 0 is a natural consequence of the fact (already 
mentioned at the end of § 143) that the minimum of ®(n) becomes steadily 
flatter as the transition point is approached. Because of this, even a slight 
perturbation has a great effect on the equilibrium value of 7. 

The quantity 


hy ~ (a|t|)°?/VBM 


gives the value of the field at which the field-induced parameter 744 ~ yh 
becomes of the same order of magnitude as the characteristic value of the 
spontaneous (zero-field) 7,, ~ (a|¢|/B)'”. Fields hh, are “weak” in the 
sense that they do not affect the thermodynamic quantities of the body in 
the first approximation. Fields h >> h, are “strong” fields for which the 
thermodynamic quantities have values determined by the field in the first 
approximation; evidently, when ¢ = 0, any field is strong in this sense. 
In strong fields, the order parameter is 


n = (hV/4BY'8. (144.9) 


It is also easily verified that in this limit the specific heat C, is independent 
of the field. 


§ 145. Change in symmetry in a phase transition of the second kind 


In the theory given in the preceding sections we have considered a phase 
transition of the second kind with some definite change in symmetry of the 
body, assuming a priori that such a transition is possible. Such a theory, 
however, does not say whether a given change of symmetry can in fact occur 
by a transition of the second kind. The theory developed in the present 
section is designed to answer this question; it starts from a different state- 
ment of the problem, whereby a certain symmetry of the body at the tran- 
Sition point itself is specified, and we ask what symmetry is possible on 
either side.of this point. 

For definiteness, we shall speak of phase transitions involving a change in 
structure of the crystal lattice, i.e. a change in the symmetry of the config- 
uration of atoms in it. Let e(x, y, z) be the density function (defined in § 128), 
which gives the probability distribution of various positions of the atoms 
in the crystal. The symmetry of the crystal lattice is the set or group of all 
transformations of the coordinates under which the function o(x, y, z) is 
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invariant. Here we mean, of course, the complete symmetry of the lattice, 
including rotations, reflections and also the infinite (discrete) set of all pos- 
sible parallel displacements (translations); that is, we are concerned with 
one of the 230 space groups. 

Let Go be the symmetry group of the crystal at the transition point itself. 
As we know from group theory, an arbitrary function (x, y, z) can be rep- 
resented as a linear combination of several functions ¢1, de, ... having the 
property of being transformed into combinations of one another by all the 
transformations in the group concerned. In general the number of these 
functions is equal to the number of elements in the group, but when the 
function ¢ itself has a certain symmetry the functions ¢, may be fewer in 
number. 

Bearing this in mind, we write the density function p(x, y, z) of the crystal 


as the sum 
= dmb: 


where the functions ¢, are transformed into combinations of one another 
by all transformations in the group Go. The matrices of these transformations 
form a representation of the group Go. The choice of the functions ¢, is not 
unique; they can obviously be replaced by any linear combinations of them- 
selves. The functions ¢, can always be so chosen as to form a number of 
independent sets containing the minimum number of functions, the functions 
in each set being transformed only into combinations of one another by all 
transformations in the group Go. The matrices of the transformations of the 
functions in each of these sets form irreducible representations of the group 
Go, and the functions themselves are the basis of these representations. Thus 


we can write 
e= Lyne, (145.1) 


n being the number of the irreducible representation and / the number of the 
function in its basis. In what follows we shall assume the functions ¢™ to be 
normalised in some definite manner. 

The functions ¢(” always include one which is invariant under all the 
transformations in the group Gp and gives what is called the unit representa- 
tion of the group. Thus this function (which we denote by gp) has the 
symmetry of Go. Denoting the remaining part of @ by de, we can write 


@=a+é6e, de=y ynyrgys (145.2) 
where now the unit representation is excluded from the summation (as 


indicated by the prime to the summation sign). The function 6g has a Jower 
symmetry than that of Go, since 6g may also remain invariant under some 
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transformations in this group but certainly does not do so under all. We may 
note that the symmetry G of the function @ (which clearly is the symmetry 
of de) has, strictly speaking, been assumed from the start to be lower than 
Go, since otherwise the sum (145.1) would include only one term, the 
function @ itself, which gives the unit representation.' 

Since the physical quantity dg must be real and must remain real under 
all transformations, we must assume physically irreducible representations 
whose base functions can be taken as real (§ 135); accordingly the functions 
@ are taken to be real. 

The thermodynamic potential ® of a crystal whose density function ¢ is 
given by (145.2) is a function of temperature, pressure and the coefficients 
n (and depends, of course, on the specific form of the functions ¢” 
themselves). The actual values of the n as functions of P and T are deter- 
mined thermodynamically from the conditions of equilibrium, i.e. the con- 
ditions for ® to be a minimum. This determines also the symmetry G of the 
crystal, since it is clear that the symmetry of the function (145.2), with func- 
tions ¢” whose laws of transformation are known, is determined by the 
values of the coefficients in the linear combination of the 6”. 

If the crystal is to have the symmetry Gp at the transition point itself, it is 
necessary that all the 7” should be zero there, i.e. 69 = 0, 0 = 05 Since 
the change in state of the crystal in a phase transition of the second kind is 
continuous, 6g must tend continuously to zero at the transition point, not 
discontinuously, i.e. the coefficients 1 must tend to zero through arbi- 
trarily small values near the transition point. Accordingly, we can expand 
the potential ®(P, T, 7) in powers of the 7” near the transition point. 

First of all let us note that, since the functions $” (belonging to the 
basis of each irreducible representation) are transformed into combinations 
of one another by the transformations in the group Go, these transformations 
can be regarded as transforming (in the same manner) the coefficients 
nf” instead of the functions ¢™. Next, since the thermodynamic potential 
of the body must obviously be independent of the choice of coordinates, 
it must be invariant under any transformation of the coordinate system, 
and in particular under the transformations of the group Gp. Thus the ex- 
pansion of ® in powers of the 1 can contain in each term only an invariant 
combination of the 7” that is of the appropriate power. 

No linear invariant can be formed from quantities which are transformed 
according to a (non-unit) irreducible representation of a group, for other- 


t For magnetic transitions, the density o(x, y, z) would have to be replaced by 
the current density j(x, y, z) in the body. In the paramagnetic phase j = 0; beyond 
the transition point j = 0j is small. 
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wise that representation would contain the unit representation, i.e. would be 
reducible. Only one second-order invariant exists for each representation: 
a positive-definite quadratic form in the 7”, which can always be 
reduced to a sum of squares. 

Thus the leading terms in the expansion of ® are of the form 


© = D+" Amy nm], (145.3) 


where the A™ are functions of P and T. 

At the transition point itself, the crystal must have the symmetry Go, i.e. 
the equilibrium values of the 1 must be zero. It is evident that ® can have 
a minimum when every 7 = 0 only if all the A™ are non-negative. 

If all the A™ were positive at the transition point, they would also be 
positive near that point, so that the 7 would remain zero and there would 
be no change of symmetry. For some 7” to be non-zero, one of the coeffi- 
cients A™ must change sign, and this coefficient must therefore vanish at 
the transition point.’ (Two coefficients A™ can vanish simultaneously only 
at an isolated point in the PT-plane, which is the intersection of more than 
one line of transitions of the second kind.) 

Thus on one side of the transition point all the A > 0, and on the other 
side one of the coefficients A is negative. Accordingly, all the (” are always 
zero on one side of the transition point, and on the other side non-zero {” 
appear. We conclude, therefore, that on one side of the transition point the 
crystal has the higher symmetry Go, which is retained at the transition point 


t Strictly speaking, this condition should be more accurately stated as follows. 
The coefficients A™ depend, of course, on the particular form of the functions 
@™, being quadratic functionals of these which depend on P and T as parameters. 
On one side of the transition point, all these functionals AM{6\; P, T} are posi- 
tive-definite. The transition point is defined as that at which (as Por T varies 
gradually) one of the A™ can vanish: 


AUP: P, TY > 0. 


This vanishing corresponds to a definite set of functions ¢/, which may in 
principle be determined by solving the appropriate variational problem. These 
will also be the functions @!” which determine the change dp at the transition 
point. Substituting these functions in A™{o\”; P, T}, we obtain just the function 
AP, T) which satisfies the condition of vanishing at the transition point. The 
functions ¢\” may then be regarded as given, as will be assumed below; the allow- 
ance for the variation of the @\” with P and T would lead to correction terms 
of higher order than those of interest here. 
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itself, while on the other side of the transition point the symmetry is lower, 
and so the group G is a sub-group of the group Gp. 

The change in sign of one of the A™ causes the appearance of non-zero 
ny” belonging to the nth representation. Thus the crystal with symmetry Gy 
becomes one with density @ = oo +d, where 


de= pm from (145.4) 


is a linear combination of the base functions of any one of the irreducible 
representations of the group Go (other than the unit representation). Accord- 
ingly we shall henceforward omit the index n which gives the number of 
the representation, meaning always the one which corresponds to the tran- 
sition considered. 

We shall use the notation 


7 = yt» m= NM (145.5) 


(so that Sy? = 1) and write the expansion of © as 
D = O(P,T) +7 AP, T) +17 3) CAP, T) LPM) 
+S BAP, TIS D+ -- + (145.6) 


where f®, f, ... are invariants of the third, fourth, etc., orders formed 
from the quantities y,; in the sums over « there are as many terms as there 
are independent invariants of the appropriate order which can be formed 
from the y,. In this expansion of the thermodynamic potential, the coefficient 
A must vanish at the transition point. In order that the transition point itself 
should be a stable state (i.e. in order that ® should have a minimum at that 
point when 7, = 0), the third-order terms must vanish and the fourth-order 
terms must be positive-definite. As has been mentioned in § 143, a line of 
phase transitions of the second kind (in the PT-plane) can exist only if the 
third-order terms in the expansion of ® vanish identically. This condition 
may now be formulated as requiring that it should be impossible to con- 
struct from the 7, third-order invariants which are transformed according 
to the corresponding irreducible representation of the group Go.‘ 


t In the language of the theory of representations, this signifies that the symmet- 
ric cube [I] of the representation I in question must not contain the unit repre- 
sentation. For the (literally) irreducible representations of the space groups, there 
cannot be more than one third-order invariant (the proof is given by M. S. Shur, 
Soviet Physics JETP 24, 845, 1967). When two representations are combined as 
one physically irreducible representation, two third-order invariants may occur. 
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Assuming this condition to be satisfied, we write the expansion as far as 
the fourth-order terms in the form 


@ = D+ A(P, TIP +1 BP, T) fC). (145.7) 


Since the second-order term does not involve the y,, the latter are deter- 
mined simply from the condition for a minimum of the fourth-order terms, 
i.e. of the coefficient of 7! in (145.7).' Denoting the minimum value of this 
coefficient simply by B(P, T) (which must be positive, as shown above), we 
return to the expansion of @ in the form (143.3), 1 being determined from the 
condition that © is a minimum regarded as a function of 7 alone, as in§ 143. 
The values of the y, thus found determine the symmetry of the function 


de = 7 oe Mi, (145.8) 


i.e. the symmetry G of the crystal which is formed in the transition of the 
second kind from a crystal of symmetry Go.t 

In this formalism, the set of quantities 1, acts asan order parameter de- 
scribing the difference between the unsymmetrical and symmetrical phases. 
We see that in general this parameter has several components; the ratios 
y, = ,/n determine the symmetry of the unsymmetrical phase, and the 
common factor 7 gives a quantitative measure of the deviation from the 
specified symmetry. 

The conditions derived above, however, are not yet sufficient to ensure 
the possibility of a phase transition of the second kind. A further essential 
condition is obtained if we consider a fact (hitherto deliberately ignored) 
relating to the classification properties of representations of space groups.$ 


t It may happen than there is only one fourth-order invariant, (2p)? = rf. 
In this case, the fourth-order term is independent of the y,, and higher-order terms 
must be used to determine the y,; these terms depend on the y, The use of 
higher-order terms may also be necessary when the minimisation of the fourth- 
order terms depending on the y; reduces them to zero. 

t In § 143 we have considered a transition with a given change of symmetry. 
Using the concepts defined here, we can say that the quantities y, were assumed 
to have given values (so that the function dg had a given symmetry). With the 
problem stated in these terms, the absence of the third-order term (in the expan- 
sion (143.3)) could not be a sufficient condition for the existence of a line of tran- 
sition points of the second kind, since it does not exclude the possibility that there 
are third-order terms in the general expansion with respect to several 1; (if the 
irreducible representation is not one-dimensional). For example, if there are three 
m, and the product 7,275 is invariant, the expansion of ® contains a third-order 
term, whereas this term will vanish if the function dg has a certain symmetry 
which requires that one or two ; should be zero. 

§ The results and examples given below are due to E.M. Lifshitz (1941). 
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We have seen in § 134 that these representations can be classified not only 
by a discrete parameter (such as the number of the small representation) 
but also by the parameter k, which takes a continuous series of values. The 
coefficients A™ in the expansion (145.3) must therefore depend not only on 
the discrete number 7 but also on the continuous variable k. 

Let a phase transition correspond to the vanishing (as a function of P and 
T) of the coefficient A™(k) with a given number n and a given k = ko. In 
order that the transition should actually occur, it is necessary that A as a 
function of k should have a minimum for k = Ko (and therefore for all vec- 
tors of the star of ko), i.e. the expansion of A™(k) in powers of k—ko about 
ky should contain no linear terms. Otherwise, some coefficients A™(k) 
necessarily vanish before A™(ko) and a transition of the type in question 
cannot occur. A convenient formulation of this condition can be obtained 
on the basis of the following arguments. 

The value of k, determines the translational symmetry of the functions ¢,, 
and therefore that of the function 6¢ (145.8), i.e. it determines the periodici- 
ty of the lattice of the new phase. This structure must be stable in compari- 
son with those which correspond to values of k close to ko. But a structure 
with k = ko-+ (where x is small) differs from that with k = Ko by a spatial 
“modulation” in the periodicity of the latter, that-is, by the appearance of 
non-uniformity over distances (~ 1/x) which are large compared with the 
periods (cell dimensions) of the lattice. Such non-uniformity can be macro- 
scopically described by regarding the parameters 7, as slowly varying func- 
tions of the coordinates (whereas the functions @, oscillate over interatomic 
distances). Thus we obtain the requirement that the state of the crystal 
should be stable with respect to loss of macroscopic homogeneity." 

When the quantities 7, are not constant in space, the thermodynamic po- 
tential per unit volume of the crystal will depend not only on the 7, but also 
on their derivatives with respect to the coordinates (in the first approxima- 
tion, on the first derivatives). Accordingly ® (for unit volume) must be 
expanded in powers of the y, and of their gradients Vn, near the transition 
point. If the thermodynamic potential (of the whole crystal) isto be a mini- 
mum for constant 7, it is necessary that the first-order terms in the gradients 
in this expansion should vanish identically. (The terms quadratic in the 


t It must be emphasised, however, that these arguments assume transitions in 
which the symmetry of the less symmetrical phase is the same along the whole 
line of transition points, i.e. the value of kg is independent of temperature. As well 
as this class of phase transitions (which alone is covered by the discussion in this 
section), there can also be transitions in which kp is dependent on temperature 
and so the periodicity of the less symmetrical phase varies along the line of tran- 
sition points. Such transitions will be considered elsewhere (see Electrodynamics) 
in connection with magnetic phase transitions. 
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derivatives must be positive-definite, but this imposes no restriction on the 
7,» since such a quadratic form exists for n, which are transformed by any of 
the irreducible representations. ) 

Among the terms linear in the derivatives, the only ones that can beof 
interest are those simply proportional to 07,/O0x, ..., and those containing 
the products 7,0n,/Ox, .... The higher-order terms are clearly of no im- 
portance. The thermodynamic potential of the whole crystal, i.e. the inte- 
gral ( ® dV over the whole volume, is to be a minimum. The integration of all 
the total derivatives in ® gives a constant which does not affect the deter- 
mination of the minimum of the integral. We can therefore omit all terms 
in ® which are simply proportional to derivatives of the n,. Among the terms 
containing products 1, 0n,/Ox, ... we can omit all symmetrical combina- 
tions 7, 0n,/Ox +n, On,/dx = 8(nn,)/Ox, ..., leaving the antisymmetrical 
parts 


Tk My (145.9) 


The expansion of ® can contain only invariant linear combinations of the 
quantities (145.9). Hence the condition for a phase transition to be possible 
is that such invariants do not appear. 

The components of the gradients Vn, are transformed as the products of 
the components of a vector and the quantities 7,. The differences (145.9) are 
therefore transformed as the products of the components of a vector and the 
antisymmetrised products of the quantities n,. Consequently the require- 
ment that no linear scalar can be formed from the quantities (145.9) is 
equivalent to the requirement that no combinations which transform as the 
components of a vector can be formed from the antisymmetrised products 


Lie = PiPe— PuPi 3 (145.10) 


here the ¢, and ¢; are the same base functions of the relevant irreducible 
representation, which we regard as taken at two different points x, y, z and 
x’, y’, z in order that the difference shall not be identically zero.* Labelling 
the base functions by the two suffixes k« (as in § 134), we write the difference 
(145.10) in the form 


Xia, kB Py Pwp Pie Prrps (145.11) 
where k, k’, ... are vectors of the same star. 
+ In the language of the theory of representations, this means that the antisym- 


metric square {J} of the representation J" in question must not contain the irre- 
ducible representations by which the components of a vector are transformed. 
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Let the vector k occupy the most general position and have no proper 
symmetry. The star of k contains n vectors according to the number of rota- 
tional elements in the group (or 2n if the space group itself does not include 
inversion), each k being accompanied by the different vector —k. The cor- 
responding irreducible representation is given by the same number of func- 
tions ¢,, (one for each k, and so we omit the suffix «). The quantities 


Ia, —e = PrP 4 — PP (145.12) 


are invariant under translations. Under the rotational elements, these n (or 
2n) quantities are transformed into combinations of one another, giving a 
representation of the corresponding point group (crystal class) with dimen- 
sion equal to the order of the group. But this representation (called a regular 
representation) contains all the irreducible representations of the group, 
including those by which the components of a vector are transformed. 

Similar considerations show that it is possible to form a vector from the 
quantities ¥,,, yg in cases where the group of the vector k contains one axis 
and planes of symmetry passing through that axis. 

This discussion becomes inapplicable, however, if the group of the vector 
k contains axes which intersect one another or intersect planes of symmetry, 
or contains inversion; such groups will be said to have a central point. 
In such cases the question of constructing a vector from the quantities 
(145.11) requires separate treatment in each particular case. In particular, 
such a vector certainly can not be constructed if the group of k contains 
inversion, so that k and —k are equivalent, and only one function ¢, 
corresponds to each k in the star; in this case there are no 7%, invariant 
under translations (as the components of a vector must necessarily be). 

Thus the requirement formulated above greatly restricts the possible chan- 
ges of symmetry in a phase transition of the second kind. Of the infinity of 
different irreducible representations of the group Go, we need consider only a 
comparatively small number for which the group of the vector k has a central 
point. 

A proper symmetry of this kind can, of course, occur only for vectors 
k which occupy certain exceptional positions in the reciprocal lattice, their 
components being equal to certain simple fractions (3. i, +) of the basic 
vectors of that lattice. This means that the change in the translational sym- 
metry of the crystal (i.e. in its Bravais lattice) in a phase transition of the 
second kind must consist in an increase by a small factor in some of the 
basic lattice vectors. Investigation shows that in the majority of cases the 
only possible change in the Bravais lattice is a doubling of the lattice vectors. 
In addition, in body-centred orthorhombic, tetragonal and cubic and face- 
centred cubic lattices some lattice vectors can be quadrupled, and in a hex- 
agonal lattice tripled. The volume of the unit cell can be increased by a factor 
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of 2, 4 or 8, in a face-centred cubic lattice also by 16 or 32, and in a hexagonal 
lattice by 3 or 6. 

Transitions are, of course, also possible without change of Bravais lattice 
(corresponding to irreducible representations with k = 0). The change in 
symmetry then consists in a decrease in the number of rotational elements, 
i.e. a change in the crystal class. 

We may note the following general theorem. A phase transition of the 
second kind can occur for any change in structure which halves the number 
of symmetry transformations; such a change may occur either by a doubling 
of the unit cell for a given crystal class or by a halving of the number of 
rotations and reflections for a given unit cell. The proof is based on the fact 
that, if the group Gp has a sub-group G of half the order, then the irreducible 
representations of Go always include a one-dimensional representation given 
by a function which is invariant under all transformations of the sub-group G 
and changes sign under all the remaining transformations of the group Go. 
It is clear that in this case there are no odd-order invariants, and no quanti- 
ties of the type (145.11) can be formed from one function. 

The following theorem also appears to be valid. Phase transitions of the 
second kind cannot occur for changes in structure which reduce to one-third 
the number of symmetry transformations, owing to the presence of third- 
order terms in the expansion of ®. 

Finally, to illustrate the practical applications of the general theory given 
above, let us consider the occurrence of ordering in alloys which, in the dis- 
ordered state, have a body-centred cubic lattice with atoms at the vertices 
and centres of cubic cells, as in Fig. 61b (§ 142).' The problem is to determine 
the possible types of ordering (called superlattices in crystallography) which 
can appear in such a lattice in a phase transition of the second kind. 

For a body-centred cubic lattice, the reciprocal lattice is face-centred cubic, 
If the edge of the body-centred cubic lattice cell is taken as the unit of length, 
the edge length of the cubic cell in the reciprocal lattice is 2-27, and in this 
lattice the following vectors k have proper symmetry groups with a central 
point: 


(a) (000) On | 

(b) ($44 On 

@ ($44), (-4 -4 -4) Tz $ (145.13) 

(@ (044, (404), 440), | 
(04-4), (-404), — (F -40) Da) 


+ Such a lattice belongs to the symmorphic space group O%. 


§ 145 Change in Symmetry 469 


These symbols show the components of the vectors k along the edges of 
the cubic reciprocal lattice cell (x, y, z axes) as fractions of the edge lengths. 
In order to obtain the vectors k in the units specified above, these numbers 
must be multiplied by 2-27 = 47. In (145.13) only non-equivalent vectors 
are shown, i.e. the vectors of each star. 

The subsequent discussion is greatly simplified by the fact that not all small 
representations need be considered in solving the problem in question. The 
reason is that we are concerned only with the possible changes of symmetry 
that can occur by the formation of a superlattice, that is, by an ordered 
arrangement of atoms at existing lattice sites without relative displacement. 
In this case the unit cell of the disordered lattice contains only one atom. 
Hence the appearance of the superlattice can only mean that the lattice 
points in different cells become non-equivalent. The change 60 in the 
density distribution function must therefore be invariant under all rotational 
transformations of the group of k (without simultaneous translation). Thus 
only the unit small representation is admissible, and accordingly u, may be 
replaced by unity in the base functions (134.3). 

Let us now consider in turn the stars listed in (145.13). 

(a) The function with k = 0 has complete translational invariance, i.e. in 
this case the unit cell is unchanged, and since each cell contains only one 
atom no change of symmetry can occur. 

(b) The function e”""*+”+2) corresponds to this k. The linear combination 
(of this function and the functions obtained from it by all rotations and 
reflections) which has the symmetry O, of the group of k is 


= cos 27x cos 2zy cos 222. _ (145.14) 


The symmetry of the phase formed is that of the density function 9 = @9+69, 
do = ng.' The function ¢ is invariant under all transformations of the class 
O,, and under translations along any edge of the cubic cell, but not under a 
translation through half the space diagonal, (+ + 4). Hence the ordered phase 
has a simple cubic Bravais lattice with two non-equivalent points in the unit 
cell, (000) and (3 3 4); these will be occupied by different atoms. The alloys 
which can be completely ordered in this way have the composition AB (e.g. 
the alloy CuZn mentioned in § 142). 


(c) The functions corresponding to these k which have the symmetry T,, are 


1 = COS HX COS TY COs NZ, cee 
de = Sin zx sin zy sin 7. so) 


t This does not mean, of course, that the change 6p in an actual crystal is given 
by the function (145.14). Only the symmetry of the expression (145.14) is important. 
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From these we can form two fourth-order invariants: (pj +93)" and (@{ +3). 
The expansion of © (145.7) therefore has the form 

B= Dot Ar? + By + Bafoit+y). (145.16) 
Here two cases must be distinguished. Let Bz < 0; then® as a function of 1 
and ys, with the added condition 77 +73 = 1, has a minimum for y, = 1, 
Yq = 0. The function de = yd, has the symmetry of the class O, with a face- 
centred Bravais lattice, whose cubic cell has a volume 8 times that of the 
original cubic lattice cell. The unit cell contains 4 atoms; the cubic cell, 
16 atoms. By placing like atoms at equivalent lattice sites we find that this 
superlattice corresponds to a ternary alloy of composition ABC: with atoms 
in the following Eee 


4A (000), (0$5) & cyclic, 

4B ($43), (004 & cyclic, 

8C (4 4 , + t ’ ( 7. 4) & cyclic, (F 4 +) & cyclic; 
here the coordinates of the atoms are given in units of the edge length of the 
new cubic lattice cell, which is twice that of the original cell (see Fig. 65a); 


“& cyclic” denotes cyclic interchange. If the B and C atoms are identical 
we obtain an ordered lattice of composition ABs. 


Now let B, > 0. Then @ has a minimum at yj = y3 = 4, so that 69 = 
161 +2)/+/2 (or (¢1— $2) /*/2, which leads to the same result).' This func- 
tion has the symmetry of the class O, with the same face-centred Bravais 
lattice as in the preceding case but only two sets of equivalent points, which 
can be occupied by atoms of two kinds A and B: 


8A (000), AGt 233) & cyclic, (O44) & cyclic, 
B GED. GED. EY Sadiq (004) & ert 
(see Fig. 65b). 


t The fact that y, and y, are in each case simply numbers is due to the presence 
in ® of only one term (which depends on y, and y,). When there are more fourth- 
order invariants, the sets of y; that minimise ® may include some that depend 
on P and T. 
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(d) The following functions with the required symmetry Du, as aa to 
these vectors k: 


$1 = COS my—Z), Ps = COS m(x—Y), os = COs a(x —2Z), 
ga = cosm(y+z), $e=cosn(x+y), be = cosx(x +z). 


From these we can form one third-order invariant and four fourth-order 
invariants, and so the expansion (145.6) becomes 


D = Do + An? +Cr(yrysys + yoysye + Y1V4Ve + Y2v4ys) 
+ By t+ BaAt+tB+ut n+ 
+ Bo(yivi + vave + V576) 
+ Bar (yryeysyat ysvaysye + ¥1727876)- 


Because cubic terms are present, a phase transition of the second kind is 
impossible in this case. To examine whether isolated points of continuous 
transition can exist and the properties of such points (see § 150) it would be 
necessary to investigate the behaviour of the function ® near its minimum; 
we shall not pause to do so here. 

The above example shows what rigid limitations are imposed by the ther- 
modynamic theory on the possibility of phase transitions of the second kind; 
for example, in this case they can exist only when superlattices of three types 
are formed. | 

The following fact may also be pointed out. In case (c), when Bz < 0, the 
actual change in the density function, 6g = 7¢1, corresponds to only one of 
the two parameters 71, y2 which appear in the thermodynamic potential 
(145.16). This illustrates an important feature of the foregoing theory: in 
considering a particular change in the lattice in a phase transition of the 
second kind, it may be necessary to take account of other, “virtually pos- 
sible”, changes. 
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It has already been mentioned several times that the actual point of a phase 
transition of the second kind is a singularity of the thermodynamic functions 
for the body. The physical nature of this singularity consists in an anomalous 
increase in the fluctuations of the order parameter, which in turn is due to 
the already described flatness of the thermodynamic potential minimum 
near the transition point. The form of this increase in the Landau model is 
easily found. We shall assume that the change of symmetry in the transition 
- is described by only one parameter 7. 
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The minimum work needed to bring the system out of equilibrium for 
given constant values of the pressure and temperature is equal to the change 
A®, in the thermodynamic potential.t The fluctuation probability for con- 
stant P and T is therefore 


w o exp (—4@,/T). ' (146.1) 


In this section the equilibrium value of 7 will be denoted by 7. For a small 
departure from equilibrium, 


A®, = 3(n—7)? (0°D,/8n?)p, r. 


By means of (144.6) we can express the derivative 0?®,/d7? in terms of the 
susceptibility of the substance in a weak field, according to the definition 
(144.7). Then the fluctuation probability (at temperatures near the transition 
point T,) is 

w cc exp [—(n—)?V/2yT 


Hence the mean square fluctuation is 
(Any) = Tex/V. (146.2) 


This expression can also be derived directly from the fluctuation-dissi- 
pation theorem. To do so, we need only note that, if the field h is identified 
with the external interaction f (with frequency w = 0) which appears in the 
formulation of the theorem (§ 124), then the corresponding quantity x is 
AnV, and the generalised susceptibility «(0) is the product yV. Formula 
(146.2) follows from (124.14). 

According to (144.8), the mean square (146.2) increases as 1/t when T ~ T,. 
For a fuller elucidation of the nature and significance of this divergence, 
let us determine the spatial correlation function of the fluctuations of the 
order parameter. We shall be concerned with long-wavelength fluctuations in 
which the fluctuating quantity varies slowly through the body; it is these 
fluctuations which, as we shall see later, increase anomalously near the tran- 
Sition point. 

For an inhomogeneous body (such as it becomes when the inhomogeneous 
fluctuations are taken into account), the thermodynamic potential would 
have to be expressed as the integral ©, = {® dV of the potential density, 
a function of the coordinates of a point in the body. However, in the de- 
scription of the thermodynamic state by the potential ®, the number N of 
particles in the body is given, not its volume (which depends on P and T). 


t In this section the thermodynamic potential (®, and later £2) for the body as a 
whole is denoted by the suffix t; letters without suffixes are used for the potential 
values per unit volume. 
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It is therefore useful to change to a description by another potential which 
relates to some chosen and specified volume V in the medium, containing 
a variable number N of particles. This can be Q(T, u), a function of tem- 
perature and chemical potential u (for given V); the variable P is here re- 
placed by wu, which has analogous properties; like P, it is constant throughout 
a system in equilibrium. 

Near the transition point, the 7-dependent terms in the expansion of the 
function ®(P, T, n) (144.3) constitute a small increment to ®o(P, 7), and 
after n has been determined by minimisation the remaining terms are all of 
the same order of magnitude. According to the theorem of small increments, 
we can therefore immediately write down a similar expansion for the poten- 
tial Q(u, T, y): 


2(u, T, n) = Qo(u, T) + utr? + byt —nh, (146.3) 


with the same coefficients but expressed in terms of a different variable, 
lt instead of T; the potential 2 is here taken per unit volume, and so the 
coefficients are « = a/V,b = B/V.* 

The expansion (146.3) relates to a homogeneous medium. In an inhomo- 
geneous body, it includes not only different powers of 7 itself but also deriv- 
atives of various orders with respect to the coordinates. For long-wave- 
length fluctuations, we need take only the terms in the expansion that con- 
tain derivatives of the lowest order (and the lowest powers of these). The 
terms linear in the first-order derivatives, of the form f(7) 0n/0x,, give in- 
tegrals over the surface on integration through the volume, which represent 
a surface effect unimportant here.t The same is true of terms proportional 
to 0n/Ox, Ox,. Thus the first terms to be taken into account in the expansion 
of 22 in derivatives are those proportional ton 62n/8x, 0x, or (On/8x;) (On/Ox;)- 
The former reduce to the latter on integration over the volume. We find 
ultimately that the function Q written above has to be supplemented by terms 
of the form 


on On , | 
Sills ox, Ox,’ 89.4) 


as always, summation over repeated vector suffixes is implied. We shall take 
only the simplest case (corresponding to cubic symmetry with 7 = 0), when 


t Here, however, it has to be remembered that the expansion of the coefficient 
A = at must now be made in powers of the difference t = T—T,(u), not T—T(P); 
in this sense, the value of the coefficient « = a/V changes. 

1 There are no first-order terms in the first derivatives in the expansion of 92, 
even when the transition is described by several order parameters. In such cases, 
the proposition can be verified only by using also the conditions for the stability 
of the body at the transition point (§ 145). 
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8x = 20%; this case already shows all the characteristic features of the 
correlation function. Thus we write the thermodynamic potential density as 


Q = Qo+utn + byt +8(6n/Or)>—nh. (146.5) 


It is evident that, for stability of a homogeneous body, g => 0; otherwise, 
2, cannot have a minimum when 7 = constant. 

Considering fluctuations for given u and T, we must write their proba- 
bility as 


w oc exp (—42,/T), 


since the minimum work needed in these conditions to bring the system out 
of equilibrium is R,;, = —42,. Specifying the value of 7 in a chosen volume 
V does not prevent the exchange of particles (or energy) between this volume 
and the surrounding “medium”. We can therefore consider the fluctuations 
of 7 for constant u (and T); cf. the beginning of § 115. 

Let us take the particular case of fluctuations in the symmetrical phase 
(in the absence of a field h); then 7 = 0, so that 4n = 7. As far as the second- 
order terms, the change in the potential Q, ist 


AQ, = § [at(An)? + 2(8 An/ory*) dV. (146.6) 


We now proceed as in § 116, expanding the fluctuation 47(r) as a Fourier 
series in the volume V: 


An = x Amem", An = Ant ; (146.7) 


the gradient is 


0 An/or = 2 ik Ane™** . 


On substitution of these expressions in (146.6), the integration over the vol- 
ume causes all terms to vanish except those which contain the products 
An,An_y, = |An,|?. The result is 


AQ, = VS (gh? +a) |Aml, 


and hence 
(|Amel?) = T/2V (gk? +a); (146.8) 


The theory of fluctuations based on an expression of this type was first devel- 
oped by L.S. Ornstein and F. Zernicke (1917) for fluctuations near the critical 
point. 
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cf. the derivation of (116.12) from (116.10). We see that as t - 0 it is in fact 
the long-wavelength fluctuations with k ~ +/(at/g) which increase.* It must 
be emphasised that the formula (146.8) itself is valid only for sufficiently 
long wavelengths 1/k, which must always be large compared with interatomic 
distances. . 

The required correlation function will be denoted by 


G(r) = (An(ts) 4n(ta)), = t1—12, (146.9) 
and is calculated as the sum 


G(r) = 2 Amn l?e%* 


or, changing to integration in k-space, 


G(r) = f (An |Dek"V Bk /(2n)?. (146.10) 


Using the Fourier transformation formula given in the footnote to § 117, 
Problem 3, we find (for r # 0) 


G(r) = (T./8agr) exp (—7/r.), (146.11) 


where 
Te = /(g/at) (146.12) 


is called the correlation radius of the fluctuations; it determines the order 
of magnitude of distances at which the correlation decreases significantly. 
As the transition point is approached, the correlation radius increases as 
1//t, and at the transition point itself the correlation function decreases as 
1/r. 

When r = 0, the integral (146.10) determines the mean square fluctuation 
of the parameter 7 in an infinitesimal volume element; it diverges for large k. 
This divergence, however, is simply due to the invalidity, in this region, of 
the expression (146.8) (which relates to long-wavelength fluctuations), and 
signifies only that ((47)*) contains a term independent of t. 

It must be emphasised, to avoid misunderstanding, that the previous 
expression (146.2) determines the fluctuations of the parameter 1 averaged 


t Analogous results are, of course, obtained on the other side of the transition 
point, in the unsymmetrical phase. Here 7] = (—at/2b)¥*, and the change in the 
potential 2, (again as far as quantities ~ (417)*) is 


AQ, = { [—2at(AnP+e(8 An/dr)*) dV, 


instead of (146.6). It is clear that the results obtained for (| 4m, |") (and for the corre- 
lation function below) will differ from the above only in that at is replaced by 
2a | tI. 
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over a volume V whose linear dimension / >> r,; this quantity may be denot- 
ed by (47)*),. The mean value of the function 47(r) over the volume V is 
just the Fourier component 47, _ 93 it is therefore natural that the expres- 
sion (146.8) for k = 0 coincides with (146.2). The latter can also be derived 
from the correlation function by the obvious formula 


(Amv = x | (Antes) dred) a a 
= > | G(r) dV, 3 (146.13) 


which is valid for any finite volume V. We may note that at the point t = 0 
itself (where G oc 1/r) this integral is proportional to 1//, where / is the linear 
dimension of the region in which the fluctuations are considered. The mean 
square ((47)*) , depends on the shape of the region as well as on its volume. 

We can now formulate the condition that determines the validity of this 
theory of fluctuations based on the expansion (146.5). This condition must 
be that the mean square fluctuation of the parameter n averaged over the 
correlation volume is small compared with the characteristic value 7? ~ 
&|[t|/b. This quantity is given by (146.2) when V ~ 7°, and we arrive at the 


condition 
T.y/r? « «|t|/b, (146.14) 


or (with y and r, from (144.8) and (146.12)) 
a {t| >> T2b?/g3 (146.15) 


(A. P. Levanyuk, 1959; V. L. Ginzburg, 1960)." 

The determination of the temperature dependences in the formulae derived 
above has also required expansions in powers of t = T—T, (in the coefii- 
cients of the expansion with respect to 7). The admissibility of such an 
expansion implies that the condition t « T, is satisfied; for it to be com- 
patible with (146.16), we must certainly have 


Tb? ag? <1. (146.16) 


The conditions (146.14)-(146.16) ensure that the fluctuations are suffi- 
ciently small, and also that the whole Landau theory of phase transitions 
given in the preceding sections is applicable. We see that a temperature range 
in which this theory is valid exists only if the inequality (146.16) is satisfied. 
In such cases, the conclusions of the theory remain valid as regards the 


t This condition can also be verified by a direct calculation of the fluctuation 
correction to the specific heat of the body near the transition point; see § 147, 
Problem. 
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selection rules for the possible changes of symmetry in transitions.‘ As 
regards the temperature dependence of the thermodynamic quantities, 
however, there is necessarily a narrow range of temperatures near T,, where 
the Landau theory is inapplicable. The conclusions of this theory must there- 
fore refer only to states of the two phases outside that temperature range. 
For example, the expressions derived in § 143 for the discontinuities of the 
thermodynamic quantities are to be regarded as the differences of their 
values at the two ends of the range. We shall call the immediate neighbour- 
hood of the point T,,, corresponding to the opposite of the inequality (146.15), 
the fluctuation range; in it, the fluctuations play the dominant role. 

In the above calculations, no account has been taken of the elastic proper- 
ties that distinguish a solid from a liquid,t or the deformation of a body in 
consequence of ordering (which we shall call striction). Within the Landau 
theory, these effects do not alter the conclusions reached in the preceding 
sections. The combined action of the two effects may, however, considerably 
alter the fluctuations of the order parameter, and therefore the nature of the 
phase transition. The investigation of this problem demands an extensive 
use of elasticity theory, and is therefore outside the scope of the present 
volume. Here, we shall simply state some results. 

The striction deformation may be either linear or quadratic in the order 
parameter, depending on the symmetry of the crystal. These two cases show a 
different influence of the elastic properties of the body on the phase transition. 

For linear striction, let y denote the order of magnitude proportionality 
coefficients between the components u, of the deformation tensor and the 
order parameter: u, ~ yn. The influence of this effect on the fluctu- 
ations appears in the neighbourhood of the transition point for which 
at S y*/A, where A is the order of magnitude of the elastic moduli of the 
body. In many cases, striction is a weak effect, and in this sense y is a small 
quantity. Then the temperature range mentioned is narrow and lies within 
the fluctuation range. 

The long-wavelength fluctuations (k S$ »/(72/Ag)) are then suppressed, and 
the correlation radius does not increase beyond values r, ~ /(gA/y”). The 
specific heat therefore has only a finite discontinuity at the transition point, as 
in the Landau theory.$ 


+ For transitions described by more than one order parameter, the establishment 
of all the conditions for the Landau theory to be valid requires more detailed 
analysis. 


+ What is important here is not so much the actual anisotropy of these properties 
but rather the fact that the deformations cannot be reduced to a single hydrostatic 
compression. In this sense, the following discussion would also apply to an iso- 
tropic solid with a non-zero shear modulus. 

§ See A. P. Levanyuk and A. A. Sobyanin, JETP Letters 11, 371, 1970. 
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Quadratic striction leads to different results.' This effect likewise suppresses 
the fluctuations, but to a smaller extent. Whereas the specific heat would 
become infinite at the transition point without allowance for striction (see 
§ 148), the quadratic striction leads instead to a small discontinuity of the 
entropy, ie. the phase transition is of the first kind and close to the second 
kind; the specific heat remains finite, although it has anomalously high 
values. 


PROBLEM 


Determine the correlation radius of the fluctuations of the order parameter in 
an external field h at T = T,. 


SOLUTION. The equilibrium value 7 is given by (144.9) and the thermodynamic 
potential density is . 
Q = Q)+ bn + e@n/er)?—hy 
3b1/3;,2/8 
= O+ 8 (n—-%)" +2(0n/or)*. 


For the correlation function we again obtain (146.11), with the correlation radius 
i= QWN6gU/2 /ZUA_UEAUS | 


§ 147. The effective Hamiltonian 


Before going on to describe the properties of the phase transition outside 
the range of applicability of the Landau theory (i.e. in the immediate neigh- 
bourhood of the transition point) we shall show how the statistical problem 
of investigating these properties could be formulated.° 

According to (35.3), the thermodynamic potential Q is given by the parti- 
tion function 


Q =—Tlog } ent fe-Fxeor dry, (147.1) 


where the integration is taken over the whole phase space of a system of N 
particles. If the integration is taken only over the part of phase space that 


t This case occurs, in particular, for transitions from the paramagnetic to the 
ferromagnetic state, where the order parameter is the magnetisation vector of the 
crystal. A linear dependence of the deformation on the magnetisation is excluded 
by the requirement of symmetry under time reversal (which leaves the deformation 
unchanged but changes the sign of the magnetic moment). 

t See A.I. Larkin and S. A. Pikin, Soviet Physics JETP 29, 891, 1969. 

§ This formulation of the problem for phase transitions of the second kind is 
due to L. D. Landau (1958). 
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corresponds to a given distribution of the order parameter 7(r), the function- 
al Q{[n(r)] defined by (147.1) may be regarded as the potential corresponding 
to that distribution. A continuous distribution 7(r) may here be conveniently 
replaced by a discrete set of complex variables n, = 1, -+in,, the components 
in the Fourier expansion (146.7). Then the definition of 2[7] becomes 


Q[n(x)] = —T log 3 euNiT | exp ( - Put? 2) 
| x] 8(m%— mp» 93 .N)) 8(m —m(p,93.N))dPy, (147.2) 


where 7,(p, 7; N) are the quantities 1”, as functions of the point Ps q in phase 
spaces It is evident that, with this definition, 


Q =—T log { exp (—Q[n]/T) II dn. dy . (147.3) 


In§ 146 it has been shown that only fluctuations with small wave vectors k 
are subject to an anomalous increase near the transition point; these fluctua- 
tions therefore govern the nature of the singularity of the thermodynamic 
functions. At the same time, such quantitative characteristics of the substance 
as the transition temperature T, itself are determined mainly by the short- 
range atomic interactions, which correspond to the short-wavelength compo- 
nents 7,. This physically obvious fact is represented in the integral over states 
by the correspondence between large values of k and large phase volumes. 

Let k, (the cut-off parameter) be a value of k small compared with the 
reciprocal of the characteristic atomic dimensions. The long-wavelength 
part of the distribution 7(r) is given by the sum 


Hr) = Os mes , (147.4) 


and the thermodynamic potential Q[7] corresponding to this distribution is 
given by (147.2), where the product over k is to be extended only to values 
k < Ko. Accordingly, the relation between 2[%] and 2 is given by (147.3) with 
integration only over fj, with k < k,.t 

Near the transition point, the functional Q[7] can be expanded in powers 
of the function 7(r); since this function is a slowly varying one, we need take 
in the expansion only the terms of lowest order in its derivatives. The expan- 
sion must also take into account the existence of the phase transition, since 


tTo simplify the treatment, we regard the physical quantity 7 as classical. 
This assumption is not important, since the long-wavelength variable 7 is certainly 
classical. For quantum systems, however, it is necessary to satisfy a condition of 
the form 7ik,u <« 7, where u is the characteristic rate of propagation of oscillations 
of the order parameter. 


32 
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the value of 7, is determined by the short-wavelength components excluded 
from 7. This means that the expansion of 2[7] must have the form (146.5) 
directly: 

QA] = Qot | (ath? + bH+e( VHP —hq] dV. 


Finally, omitting the tilde, we arrive at the following expression for the 
thermodynamic potential 2: 


Q—Qy = —T log f exp (— Hesr/T-) mel dm. dn , (147.5) 
<ko 


where 
Here = § [atn? + br! +2(7n)—hn] dV (147.6) 


acts as the effective Hamiltonian of the system undergoing a phase transition. 

In the range where the Landau theory is applicable, the fluctuations are 
small. This means that, in the integral over states (147.5), the important 
values of 7 are those lying in a narrow range near the value 7 that minimises 
the effective Hamiltonian. Using the saddle-point method (i.e. replacing 
the exponent by its expansion about the minimum), we ought to recover the 
thermodynamic potential as in the Landau theory; hence the coefficients in 
the effective Hamiltonian and in the Landau thermodynamic potential 
must be exactly the same. The fluctuation corrections, however, cause a 
certain shift of the transition temperature T, relative to the value T® which 
occurs in (147.6) through the difference t = T—-T. 

The integral (147.5) is taken over an infinity of variables 7, (when the 
effective Hamiltonian has been expressed in terms of these variables by 
substituting 4(r) from (147.4)). If this “continuum integral” could be calculat- 
ed, we should know the nature of the singularity of 2(u, T) near the tran- 
sition point. The calculation is impossible, however. 

In the creation of the singularity, the significant fluctuations are those 
with wave numbers k ~ 1/r,. When t > 0, the correlation radius r, + ©, 
so that very small values of k become important. It is therefore very prob- 
able that the nature of the singularity does not depend on the choice of the 
cut-off parameter ko. If this singularity is assumed to consist in the presence 
in the thermodynamic potential of terms containing fractional powers of 
the temperature ¢ and the field h, the above assertion implies that the indices 
of these powers (the critical indices) are independent of Ko. 

Hence it must in turn follow that these indices are independent of the 
specific values of the coefficients b and g in the effective Hamiltonian, and 
therefore of « or P, on which b and g depend. For a change ky — ko/a is 
equivalent to a change in the coordinate scale (r — Ar), and such a change 
therefore cannot affect the critical indices. On the other hand, the transforma- 
tion r > Ar changes the coefficient g in the effective Hamiltonian, but not 5; 
the critical indices must thus be independent of g. Similarly, by making the 
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simultaneous transformations r > Ar, and 4 ~ An for the continuum integra- 
tion variable, we change b but not g, and the critical indices are therefore also 
independent of b. A change in the coefficient « is unimportant, since it is 
cancelled by a corresponding change in the scale of ¢, which cannot affect 
the exponent. 

Thus we should expect that the critical indices are the same for all systems 
with an effective Hamiltonian of the form (147.6). They may differ, however, 
if the symmetry of the system is such that (with one order parameter, as 
before) the term quadratic in the derivatives in the effective Hamiltionian 
has the more general form (146.4). 

Continuing this line of argument, we may expect that in more general 
cases also, when the change of symmetry in the transition is described by 
more than one order parameter, the critical indices depend only on the struc- 
ture of the effective Hamiltonian, not on the specific values of the coefficients 
in it. Here the “structure” of the Hamiltonian includes the number and 
form of the fourth-order invariants (and the signs and inequalities relating 
to their coefficients) and the form of the terms quadratic in the derivatives 
of the order parameters. The questions then arising, however, have as yet 
hardly been studied at all. 

Lastly, we may add a few remarks on the calculation of the successive 
terms in the expansion of the partition function (147.5), (147.6) in powers of 
b. Leth = Oandt > 0, so that = 0; when b = 0, the effective Hamiltonian 


Is 
He = VY (at+gk)| mel? (147.7) 
k<ky 


it separates into a sum of terms each depending only on one of the y,. The 
integral over states is easily calculated (see Problem). Further terms in the 
expansion (corresponding to the inclusion of the “interaction” between 
fluctuations with different k) are products of different 7, averaged over the 
Gaussian distribution [oc exp (— H{}/T.)]. For such integrals a theorem is 
valid whereby the mean value of the product of several 1, is equal to the sum 
of the products of pairs of mean values of factors chosen in all possible 
ways from those concerned. Each such mean value is a correlation function 
of fluctuations (in the k representation), and hence the calculation of the 
successive terms in the expansion in powers of b amounts to that of various 
integrals of products of correlation functions.t As the transition point is 


+ The theorem stated plays a role here analogous to that of Wick’s theorem in 
quantum electrodynamics, and the separate terms of the series may be transformed 
by graphs analogous to Feynman diagrams. An account of the resulting “diagram 
technique” for calculating the partition function is given by A. Z. Patashinskil 
and V. L. Pokrovskil, Fluctuation Theory of Phase Transitions, Pergamon Press, 
Oxford, 1979. 
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approached, these integrals diverge, but it is not possible to distinguish 
among them any set of “most strongly” divergent ones over which a summa- 
tion could be carried out.t 

In the above formulation of the problem, it is assumed that the nature 
of the singularity is independent of any higher-order terms in the expansion 
of the effective Hamiltonian in powers of 7. There are good reasons for sup- 
posing that this is in fact so, since such terms lead to integrals that diverge 
less strongly than those resulting from the term ~ 7. 


PROBLEM 


Find the first fluctuation correction to the specific heat in the range where the 
Landau theory is valid (A. P. Levanyuk, 1963). 


SOLuTION. The calculation will be given for the symmetrical phase in the absence 
of a field. In the first approximation, the effective Hamiltonian is (147.7). A cal- 
culation of the integral over states from (147.5) gives 


T 
Q-2,=—-T Aiea ei 
” oy, © Wattek) 


Ko 
Watt gk®)  2k® dk | 
= rw | log T° aa 


the integration is over half of k-space, since 7, and 7_, are not independent. 
Being the small correction in the potential Q, this expression also gives the correc- 
tion to the potential ®. A twofold differentiation of this expression with respect 
to ¢ gives the correction to the specific heat: 


_ T2Ve2 ( Kdk _ T2Ve3® 1 


Cee = aa | aabele Teg V7 


If this correction is to be small compared with the specific heat discontinuity 
(143.8), we again obtain the condition (146.15) for the Landau theory to be ap- 
plicable, in the form 


a [t] >> 72b?/32a%¢5. (2) 


The large numerical factor in the denominator on the right-hand side should 
be noted. 


+ Such a distinguishing is possible in the formal problem of a phase transition 
in four-dimensional space (in which case the integrals diverge logarithmically as 
t + 0). This is the basis of a method proposed by K. G. Wilson (1971) for estimat- 
ing the critical indices, which are calculated for the case of a space of 4—e dimen- 
sions (where € is small), the result then being extrapolated to e = 1. 
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- § 148. Critical indices 


The existing theory of phase transitions of the second kind is based on certain 
hypotheses that are entirely plausible though not rigorously proved. It rests 
also, of course, on the confirmation of these hypotheses by experimental 
results and by numerical calculations for certain simple models. 

These results suggest that, as T~- T,, the derivative 0C,/OT always be- 
comes infinite, and so in many cases does the specific heat C, itself. From this 
we can already draw various conclusions about the behaviour of certain 
other thermodynamic quantities. This will be done here on the assumption 
that the specific heat itself becomes infinite (A. B. Pippard, 1956). 

The tending of C, = T(0S/0T)> to infinity means that the entropy of the 
body can be written as 


S = S(T, P—PAT)), 


where P = PT) is the equation of the curve of the points of the phase 
transition in the PT-plane; the derivative of S with respect to its second 
argument becomes infinite as P—P, + 0. Denoting differentiation with 
respect to that argument by a prime and retaining only the divergent terms, 
we have 
(0S/0T)p = —S’ dP,/aT, 
—(0V /OT)p = (0S/OP)r = S’, 
whence 
dP. (OV 
c= 1. Gt (Gr), as T+Th, (148.1) 

ie. the thermal expansion coefficient tends to infinity in the same way as C,. 

It is easily seen that this derivation amounts to equating to zero the diver- 
gent part of the derivative of S along the curve of transition points. It is 
therefore natural that (148.1) has the same form as_ the equation (143.10) 
obtained by differentiating 4S = 0 along the same curve, and differs from 
it only in the absence of the symbol 4. We can hence write down at once 
another relation by analogy with (143.9): 


4 OV\ dT. Cy (ATe\? 
| - (ap), = (ar) “ap = 7 ( aE) , (148.2) 


ie. the isothermal compressibility also becomes infinite (but the adiabatic 
compressibility remains finite, by (16.14)). The specific heat C, remains 
finite; from (143.14) we see that it is not even discontinuous at the transition 
point: since the right-hand side of (143.14) is zero (because (0V/@P), is 


484 Phase Transitions of the Second Kind and Critical Phenomena 


infinite), AC, = 0.' The same is true of the derivative (@P/8T), ; substitution 
of (148.2) in (16.10) shows that on the transition line 


(OP/OT)y = dP,/aT. | (148.3) 


It should be emphasised that the above results essentially depend on the 
occupation, by the phase transition points of the second kind, of an entire 
line in the PT-plane, the slope of this line being finite. 

Let the temperature dependence of the specific heat in the fluctuation 


range be written 
C0 |p=*, (148.4) 


where again t = T—T,. We shall see later in this section that there is reason 
to suppose the values of the exponent « equal on either side of the transition 
point (and the same is true of the other exponents introduced below). The 
proportionality factors in (148.4) are, of course, different on the two sides. 
Since the quantity of heat { C, dT must always be finite, it follows that a < 1. 
If only 6C,/oT, and not the specific heat itself, tends to infinity, then —1 < 
a < 0; the expression (148.4) in that case determines only the singular part 
of the specific heat: C, = C,o+C,, |t|~*. 

The way in which the equilibrium value of the order parameter tends to 
zero in the unsymmetrical phase may be written as 


no (—-1f, B>O. (148.5) 


By definition, the exponent f refers only to the unsymmetrical phase. We shall 
assume for definiteness here and below that temperatures f < 0 correspond 
to the unsymmetrical phase. 

To describe the properties of the fluctuations themselves of the parameter 
4, an exponent » is used which defines the temperature dependence of the 


correlation radius: 
reo jtl-", v>O0, (148.6) 


and an exponent ¢ which gives the decrease of the correlation function with 
increasing distance at ¢ = 0: 


G(r) a r~-24+6), (148.7) 


where d is the dimension of space (== 3 for ordinary bodies). The expression 
(148.7) is written in this form in order to have a definition suitable also for 
phase transitions of the second kind in two-dimensional systems (d = 2). 


t The impossibility of infinite C, on the transition line is also evident from the 
fact that it would lead to C, = T(dV,/dT)? (OP/OV)z (cf. (143.14)), which is certainly 
impossible, as C, is positive and (OP/OV)r is negative. The specific heat C,, has, 
however, an infinite derivative on the transition line (see Problem). 
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The law (148.7) is valid also for non-zero |¢| « 7,, but only for distances 
rr. 

The exponents in (148.4)—(148.7) are called critical indices or critical 
exponents. It must be emphasised that the degree of accuracy of the subsequent 
derivation of relations between the critical indices does not allow the 
distinguishing of logarithmic factors superimposed on power laws. In this 
sense, for example, a zero exponent may mean either that the quantity tends 
to a constant limit or that it increases logarithmically. 

A further group of indices are used to describe the properties of a body 
in the fluctuation range when an external field h is present. Here we must 
distinguish the ranges of fields that are “weak” and “strong” in the sense 
described at the end of § 144:h<h, or h >h,, where h, is the field value 
for which the field-induced parameter 7,5 ~ yh becomes of the same order 
as the characteristic value of the spontaneous-order parameter 7,,(#). The 
region of weak fields has an exponent y giving the variation of the suscep- 
tibility: 

yo |t|-”, yp>O. (148.8) 
The indices defined above may also be assigned to this region: the laws 
(148.4)-(148.6) derived for zero field relate, of course, also to the limiting 
case of weak fields. 

For the opposite case of strong fields, we define critical indices which give 
the field dependence of the thermodynamic quantities and the correlation 
radius: 


Cc h=*, (148.9) 
noc hve (6 > 0), (148.10) 
re oc h-# (u > 0); (148.11) 


we take for definiteness the case h > 0.t 

The universality of the limiting laws for the behaviour of a substance in the 
fluctuation range near a phase transition point of the second kind, in the 
sense discussed in § 147, implies a similar universality of the critical indices. 
Thus we must expect their values to be the same for all transitions with a 
change of symmetry described by only one order parameter. 

The critical indices are connected by various exact relations. Some of 
these follow almost immediately from the definitions of the indices, and we 
shall first derive these. 

It has been shown in§ 144 that the application of an external field A smooths 
out the phase transition over a certain temperature range. The magnitude 


t The Landau theory corresponds to the following values of the critical indices: 


a = 0, p= i, y= 1, 6= 3, e= 0, p=, v= 3, C= 
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of this range of ¢ can be estimated from the condition 7,,4(4) ~ 1,,(#) men- 
tioned above, if this is now taken as a condition on # for given h. According 
to the definitions (148.5) and (148.8), we have 


Nop CEP, Ning = YRoch|t|-”, 


and equating the two gives 
[r|P+y oc h. (148.12) 


On the other hand, the same smoothing range can be estimated from the 
requirement that the field part of the thermodynamic potential, —Vmh, is of 
the same order of magnitude as the thermal part, which is ~ PC, since 
C, =—T0G/eT*. Hence we find |t|~?-*~* oc A, and, expressing A in terms 
of ¢ from (148.12), obtain the equation 


(J. W. Essam and M. E. Fisher, 1963). 

We next use the obvious fact that, at the edge of the region of smoothing 
of the transition (i.e. with the condition (148.12)), we can equally well express 
each thermodynamic quantity in terms of the temperature ¢ or the field h. 
Hence we find, for example, 


m oc |t/F oc AM, 
and, expressing A in terms of ¢ from (148.12), obtain the equation 
fd = Bty (148.14) 


(B. Widom, 1964). By a similar method starting from the two limiting forms 
of the specific heat C,, we find 


e(B+y) = a. (148.15) 


The equations (148.14) and (148.15) relate the indices that determine the 
temperature dependence of the thermodynamic quantities in weak fields and 
their dependence on A in strong fields. 

An analogous equation is found by the same method for the indices 
giving the behaviour of the correlation radius: 


u(B+y) = ». (148.16) 


Lastly, one further relation can be derived by an estimate of the expressions 
on either side of (146.13). According to (146.2) and the definition (148.8), 
the mean square fluctuation in a given volume V is 


(Any = Tez/V oc |t|-”. 
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The integral of the correlation function is determined by the region of space 
~ rin which this function is considerably different from zero and, according 
to the definition (148.7), its order of magnitude is oc r>@+?+, Hence the 
magnitude of the integral is (in d-dimensional space) 


oc poopy 4-248) = p28 o¢ | f[-72-9, 
A comparison of the two expressions gives 
wW(2—C) = y. (148.17) 


Thus we have five relations between the eight indices, and so they can all 
be expressed in terms of only three independent ones. 

From this we can, in particular, draw the conclusion already mentioned, 
that the values of the “temperature” indices «, y, v are the same on both 
sides of the transition point: for, if y, say, were different for t > 0 and 
t < 0, it would follow from (148.14) that 6 also depends on the sign of t. 
This index, however, relates to strong fields h, which satisfy only the condi- 
tion h >> h, independent of the sign of t, and it too therefore cannot depend 
on this sign; the same is true of the other two “field” indices, « and u. From 
the relations (148.13) and (148.16), we then find that « and » also are inde- 
pendent of the sign of f. 

The results obtained enable us to draw some conclusions about the ther- 
modynamic functions of the system for any relationship between ¢ and h. 
This will be demonstrated for the function 7(¢, h), which may be written in 
the form 


t 
q = aa ‘ 


(for given P). The choice of the first argument of fis determined by the condi- 
tion (148.12), which separates the cases of weak and strong fields (and we 
have put B-+y = Bé in accordance with (148.14)); this argument takes all 
values from small to large. The argument ¢ is always small near the transi- 
tion point, and must be taken as zero in obtaining the principal term in the 
function 7(¢, h). Thus we arrive at the expression 


n(t, h) = hf (t/hU), h>0, (148.18) 


where fis a function of the one argument x = t/h”"*, The expression (148.18) 
is written for h > 0; owing to the symmetry of the system under a simulta- 
neous change of sign of h and n, the formula for h < 0 is obtained from 
(148.18) by simply substituting —h for h and —7 for n. 
In strong fields (x « 1), the limiting form (148.10) must be obtained; 
this means that 
f(x) = constant for x-— 0. (148.19) 
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Moreover, when A ~ 0 the order parameter is non-zero both for t > 0 and 
for t < 0, and ¢ = Ois not physically distinctive; this means that the function 
f(x) has an expansion in integral powers of x. 

In weak fields when ¢ < 0, the order parameter follows the law (148.5), 
and for t > 0 we must have n = yh with x given by (148.8); from these 
requirements it follows that 


f(x) oc (—xfP as x+—m, f(x) x7” as x +o. (148.20) 


The concept of a weak field presupposes that t ~ 0. For a given non-zero 
value of ft, a zero field is not a singularity of the thermodynamic functions. 
Hence the function 7(t, h) for t + 0 can be expanded in integral powers of 
the variable h (and the expansion is different for t > 0 and t < 0). A natural 
formulation of this property would, however, require 7(f, h) to be written 
not in the form (148.18) but in terms of a function of the variable h/t’. 

Similar considerations can be applied to the correlation function of fluc- 
tuations of the order parameter. For instance, in the absence of a field it 
depends on the parameter f as well as on the distance r. Near the transition 
point, however, the correlation function G(r; t) can be written as 


1 


i.e. in terms of a function of the one variable x = rr’. As x ~ 0, this function 
tends to a constant limit, in accordance with the definition (148.7), and as 
x -— 0 it decays exponentially, the correlation radius as a function of tem- 
perature being given by (148.6). 


PROBLEM 


Find the law of temperature dependence as t + 0 for the derivative 0C,/O7 if 
C, tends to infinity as (148.4) with « > 0. 


SOLUTION. With greater accuracy than in (148.1), (148.2), we write for t -> 0 


p= Te ap (ar) +4 
3 aVv\ dP, b dT, 
<a =- — f —+— se 
oT], OP}/,,dT 'T, dP’ 


with a and b constants. Substituting these expressions in (16.9), we find 
C, = a—b—B*I/C,. 


If C, increases as |t|~*, the OC, /OT oc |¢|~2-”. At t = 0 the function C,(s) has 
a maximum at a cusp with a vertical tangent. 
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§ 149. Scale invariance 


The relations (148.13)-(148.17) do not involve any assumptions about the 
nature of the fluctuation pattern near the transition point. Further conclu- 
sions about the critical indices require specific assumptions in this respect. 

We may note that the theory in general involves two characteristic dimen- 
sions which determine the spatial distribution of the fluctuations: the corre- 
lation radius r, and the dimension ro of the part of the body in which the 
mean square fluctuation of the order parameter is comparable with its char- 
acteristic equilibrium value.t The inequality (146.14) which ensures the 
applicability of the Landau theory may be written r, >> 7 (for, according to 
(146.13) and (146.11), we have, in a volume V ~ 73, ((An)’) ~ T,/gry and, 
equating this to 7? ~ «|t{/b, we find ro ~ T,b/ga|t|; comparison with 
r, (146.12) gives the condition (146.15)). As ¢ > 0, 79 increases more rapidly 
than r,, and they become comparable at the boundary of the Landau region. 
The chief assumption about the fluctuation region (this being defined by the 
opposite inequality to (146.15)) is that the theory here contains no small 
parameter. In particular, we must everywhere have rp ~ r,, so that r, is the 
only dimension characterising the fluctuations. This is called the hypothesis 
of scale invariance (L. P. Kadanoff, 1966; A. Z. Patashinskif and V. L. 
Pokrovskii, 1966). 

To estimate the fluctuations in the volume V ~ 7°, we can use formula 
(146.2).8 Substituting in the condition 


Te4|V ~ 1? (149.1) 


the volume V ~ r4 and then expressing 4, r, and 7 as powers of t according 
to the definitions of the critical indices, we obtain »d—y = 28 or, using 
(148.13), 


vd = 2a. (149.2) 


By combining this relation with those derived in§ 148, we can express all the 
critical indices in terms of only two independent ones.!! 


t It is therefore not surprising that all these relations are satisfied in the 
Landau theory also. 

+ This discussion refers, of course, only to the distribution at distances large 
compared with atomic dimensions. 

8 In this form (i.e. expressed in terms of the susceptibility y) this formula is 
general and does not depend on the assumptions made in the Landau theory, as 
mentioned after (146.2). 

ll In the Landau theory there is no scale invariance, and equation (149.2) is 
therefore invalid. 
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The requirement of scale invariance enables us to derive in a uniform 
way all the relations between the critical indices. To do so, we shall first give 
a more formal statement of this requirement. 

Let the scale of all spatial distances change by the same factor: r ~ r/u, 
where u is some constant. Then the scale invariance consists in the assertion 
that the scales of measurement of ¢, h and 7 may be changed in such a way 
that all the relations in the theory remain unaltered. In other words, we can 
choose the exponents 4,, 4,, 4, (called the scaling dimensions) in the trans- 
formations 


t—- tu4, h—~— hud, n—> nu4n when r—r/u (149.3) 


in such a way that the factor u does not appear in any of the relations. 

The change in the spatial scale must, in particular, bring about a similar 
change in the correlation radius of the fluctuations (7, ~ r,/u); this ensures 
that the asymptotic form of the correlation function ~ exp(—r/r,) is in- 
variant. According to the definitions (148.6) and (148.11), with A = 0 the 
correlation radius r, = constantX1~’, and with ¢ = 0, r, = constantXxh™. 
Applying the transformation (149.3) and the condition that the coefficients 
in these expressions should remain unchanged, we obtain 


A,=1/rv, dy = Iu. (149.4) 


Let us next consider the change in the thermodynamic potential due to an 
infinitesimal change in the field h. According to (144.2), 


dd = —Vndh 


(with ¢ = constant and, as always, P = constant). In the scale transfor- 
mation, the volume V + V/1; with the condition that d® is unchanged, 
i.e. that 
Vu-4 .nu4n-dhuds = Vy dh, 
we find 
A, = d—A, = d—1/u. (149.5) 


Thus the dimensions 4,, 4, and 4, are expressed in terms of the two critical 
indices 4 and v. The requirement of scale invariance of the other relations 
leads to the expression of the other critical indices in terms of these two. 

Let us specify the condition for invariance of the “equation of state” 
of the system, i.e. the expression of the order parameter in terms of the 
temperature and the field, 7 = 7(t, h). This means that we must have 


y(tud, hudrs) = udnn(t, h). 


The solution of this functional equation has the form 
(t, h) = hédl4nf(t/hads) 
= Wa-3f(t/her), (149.6) 
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Similar arguments can be applied to the thermodynamic potential O(¢, h), 
or more precisely to its singular part, which we shall use ® below to denote. 
Being additive, the total thermodynamic potential of the body is proportional 
to its volume. Hence the requirement of its invariance under the scale trans- 
formation may be written 


1 1 : 
7 P(tu4, huts) = — Blt", hut) = G(s, hh), 


from which 
D(t, h) = htep(t he’, (149.7) 


The functions f and @¢ in (149.6) and (149.7) are of course related, since 
—00/dh = nV. These expressions have been written forh > 0; in view of the 
symmetry of the effective Hamiltonian under the changes h + —h, 1 + —y, 
the formulae for h < 0 are obtained from those given above by making these 
same changes.' 

Similar arguments can be based on (149.7). As already mentioned in 
connection with (148.18), for a given non-zero h the thermodynamic func- 
tions have no singularity with respect to ¢ and must therefore have expan- 
Sions in integral powers of this variable. This means that, when h + 0 and 
t + 0, the function $(x) in (149.7) is expanded in integral powers of the small 
variable x = t/h“’”. The first terms in this expansion are 


t e 
P(t, h) oc wl bea ae +8 Fae + |, (149.8) 


where c, and cs are constant coefficients. Now, applying the condition that 
the order parameter and the specific heat, calculated as 


1 0@ Fo 2) 
1= 7 RH? @—Teae 


should have the forms 7 och” and C, oc h-* as t - 0 (these correspond to 
the strong-field case), we get two relations between the critical indices: 


(ud—1)6 = 1, u(—2) og 


v 


t It may be mentioned once more, however, that in the effective Hamiltonian 7 
appears as a variable over which the continuum integration is taken in the integral 
over states. In the thermodynamic formulae, 7 is the equilibrium value of the order 
parameter, which is given by the derivative 0@®/Oh (or 022/0h) of the thermody- 
namic potential determined from the integral over states. The symmetry of the 
effective Hamiltonian leads, of course, to an analogous symmetry in the thermo- 
dynamic relations. 


492 = Phase Transitions of the Second Kind and Critical Phenomena 


it is easily verified that they in fact follow from the relations derived pre- 
viously by a different method. 

Next, let ¢ be non-zero; then the thermodynamic quantities have no sin- 
gularity when h passes through zero, and so the function P(t, h) can be 
expanded in integral powers of h. This means that for h + 0 and ¢ # 0 
the expansion of the function ¢(x) in powers of the small variable 1/x = h“?/t 
must have the form 


f(x) oc x41 + cyx—-/# 4+ cox 2H +. J; 


the factor x compensates the non-integral power h*“, and the expansion 
variable x~”“ oc h. The expansion is, however, different for ¢ > 0 and for 
t < 0. When ¢ > 0, the potential D(¢, h) contains only even powers of h, 
since the derivative —d®/dh = Vn must be (in the symmetrical phase) an 
odd function of h: 


2 
Doc él 1tes +o, t>0, h-0. (149.9) 


perl 


Ash - 0, the specific heat must behave as ¢~*, and the order parameter 
must be 1 = yh oc ht—” (corresponding to the weak-field case); it is easily 
seen that the resulting relations are again equivalent to those already known. 
If the temperature ¢ < 0, then the expansion of O(t, h) as h + 0 contains all 
integral powers of h: 


h h? 
® x (—| Leer ae tee pe |, t< 0, h-0O 


(149.10) 


(with, of course, different coefficients c; and cg). It is easily verified that the 
required form (—1)° is obtained for the spontaneous (h-independent) order 
parameter. 

The transformation of the correlation radius has been discussed above. 
We have still to consider the correlation function of the fluctuations of the 
parameter 7 for ¢ + 0 and apply the conditions for scale invariance of the 


expression 
G(r) = constant X r~@-2+0) (t = 0). 


Here it must be assumed that the fluctuating quantities 7(r) at different 
points in space are transformed independently in the same way as the mean 


t If (149.10) relates, say, to fields A > 0, the formula for 4 < 0 is obtained from 
it by the substitution h + —h. It may be recalled (see § 144) that for ¢ < 0 the 
states in fields of opposite sign refer to physically identical “phases” differing 
in the sign of the order parameter (both spontaneous and field-induced); as h ~ 0, 
the two phases are in equilibrium with each other. 
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value 7. It is important that the distances r concerned are small compared 
with the correlation radius, but large compared with interatomic distances. 
Then the correlation function is transformed by G ~ Gu?“", and we have the 
condition 


d+2—= = 6. (149.11) 


This also is a consequence of the relations already known. 

Let us finally consider the numerical values of the critical indices. The 
experimental results and numerical calculations indicate that (in three- 
dimensional space) the indices « and ¢ are quite small: « ~ 0.1, € ~ 0.05. 
The first line of the following table gives the values of the other indices 
found by putting « = € = 0 (d = 3). The second line gives the values ob- 
tained by taking for « and € the values estimated by Wilson’s method (§ 147) 
for transitions described by an effective Hamiltonian (147.6) with a single 
order parameter: 

o B y ) E Ub v C 
+ + 5 0 4 #2 0 (149.12) 
0.08 0.33 1.26 4.8 0.05 0.40 0.64 0.04 


§ 150. Isolated and critical points of continuous transition 


The curve of phase transitions of the second kind in the PT-plane separates 
phases of different symmetry, and cannot, of course, simply terminate at 
some point, but it may pass into a curve of phase transitions of the first kind. 
A point at which this happens may be called a critical point of a transition of 
the second kind; it is in some ways analogous to an ordinary critical point. 
See the point K in Fig. 66; in this and subsequent diagrams, continuous and 
broken lines represent curves of phase transition points of the first and second 
kinds respectively.* 


Pp 

t ye 
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| / 
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t The term sricritical point is also used in the literature for a point such as K. 
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In the Landau theory, the properties of a substance near such a point may 
be investigated by the method (§ 143) of expansion in powers of the order 
parameter (L. D. Landau, 1935). 

In the expansion (143.3) the critical point is given by the vanishing of the 
two coefficients A(P, 7’) and B(P, T); if A = 0 but B > 0 we have a transition 
of the second kind, and so the curve of such transitions terminates only where 
B changes sign. For the state of the body at the critical point itself to be 
stable, it is necessary that the fifth-order term should be identically zero and 
the sixth-order term should be positive. Thus we start from the expansion 


O(P,T,) = Po P,T)+ AP, Ty? + BP, Ty + DP, T)n®, (150.1) 


with 4, = 0, B,, = 0, D,, > 0 at the critical point. 
In the unsymmetrical phase, the minimisation of the thermodynamic po- 
tential gives 


jie -_ [—B-+4/(B°—3AD)]. (150.2) 


For the entropy S = —0@/dT of this phase we have, omitting terms of 
higher order in n, S = So—an?, where a = 0A4/OT. A further differentiation 


gives the specific heat 
Ta’ 


i 24/(B?—3AD)’ 


(150.3) 
where only the term whose denominator vanishes at the critical point is 
shown. 

Let Typ = To(P) be the temperature for which B?—3AD = 0; it is evident 
that 7, = T,, when P = P,,. The first term in the expansion of B?—3AD in 
powers of T—Tp is 


B*—3AD = —3aoDo(T—T)). (150.4) 


Near the critical point, the difference T,(P)—T7)(P) is a second-order small 
quantity: since we have A = 0 when T = TP), the difference 


T(P)—To(P) = —B?/3aDo, (150.5) 


and tends to zero as B® when P - P,,. 
Substitution of (150.4) in (150.3) gives 


T\2 4 
C= (ap). RT (150.6) 


to the same accuracy, the coefficient in this formula may be taken at T,, 
instead of T>. Thus the specific heat of the unsymmetrical phase increases as 
1/-/(To—T) as the critical point is approached. 
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For states actually on the curve of transitions of the second kind, we find, 
substituting A = 0 in (150.3) (or (150.5) in (150.6), 


CH = Tu02,/2B. (150.7) 


The quantity B is zero at the critical point and is proportional to T—T,, 
(or P—P,,) near that point. 

Let us now determine the specific heat of the unsymmetrical phase, on a 
line of transitions of the first kind but again near the critical point. At points 
on this line, two different phases (symmetrical and unsymmetrical) are in 
equilibrium with each other. The value of the parameter 7 in the unsymmet- 
tical phase is determined by the condition of equilibrium ®(y) = ®o, and at 
the same time we must have 60/67 = 0. Substitution of ® in (150.1) gives 
the equations 

A+Br?+Dny = 0, A+2Br?+3Dn! = 0, 
whence 
1? = —B/2D, (150.8) 


and substitution of this value in the equation ®(7) = Do then gives 
4AD = B?. (150.9) 


This is the equation of the line of transitions of the first kind. 
The specific heat of the unsymmetrical phase on this line is found by simply 
substituting (150.9) in (150.3): 


CD = T2az/| Bl. (150.10) 


Comparison with (150.7) shows that the specific heat on the line of transi- 
tions of the first kind is twice as great as on the line of transitions of the 
second kind at the same distance from the critical point. The heat of tran- 
sition from the unsymmetrical to the symmetrical phase is 


q= Te(So—S) 
= (aT/2D)-r | BI. (150.11) 


We can also show that the curve of transitions of the first kind passes 
smoothly into the curve of transitions of the second kind at the critical point. 
On the first curve, the derivative d7/dP is determined by the condition 


2D dA+2A dD—B dB = 0, 


obtained by differentiating (150.9). The equation of the curve of transitions 
of the second kind is A = 0, so that d7/dP is determined by the condition 
dA = 0. At the critical point, 4 = 0 and B = 0, so that the two conditions 
coincide and d7/dP has no discontinuity. Similarly, it can be shown that the 
second derivative d?7/dP? does have a discontinuity. 
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As the critical point is approached along the line P = P,, the specific 
heat C, varies as | t]~*” (cf. (150.6)), i.e. the index a = 4. The order parameter 
in the unsymmetrical phase varies according to 7 = (—A/3D)"* oc [t|"4, 
i.e. the index 6 = 4. The index » which determines the behaviour of the corre- 
lation radius then has the value 4, as at any transition point of the second 
kind in the Landau theory; the vanishing of B does not affect the result in 
the approximation used to derive (146.8). The values of the remaining indices 
are derived from (148.13)-(148.17) asy = 1,6 =S5,e= w= 2,6=0. 

We know already that the Landau theory, on which the results given here 
are based, is inapplicable near a line of transitions of the second kind. It is 
noteworthy, however, that the conditions for this theory to be valid are more 
easily satisfied as the critical point is approached, as is seen from the inequal- 
ity (146.15), where B appears on the right. Of course, the vanishing of B 
does not mean that there are no fluctuation corrections at the critical point, 
but the index values given above are in accordance with the scale invariance 
relation (149.2). It is therefore natural that the fluctuation theory leads only 
to further logarithmic corrections (which, it may be recalled, are not shown 
by the values of the indices). 

Let us next consider (again in terms of the Landau theory) some proper- 
ties of points of intersection of lines of phase transition of the first and second 
kinds. 

The symmetry of the unsymmetrical phase in a phase transition of the 
second kind is determined (as shown in § 145) by the minimisation of the 
fourth-order terms in the expansion of ® as functions of the coefficients 
¥; = 7,/n. These terms also depend on P and T, however, and it may happen 
that the unsymmetrical phase has different symmetries on different parts of 
the line of transitions. In the simplest such case, we have an intersection 
of a line of transitions of the second kind (AC in Fig. 67) with one of tran- 
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sitions of the first kind (BD). The region I is the symmetrical phase; the sym- 
metry groups of phases II and III are sub-groups of that of phase I. They are 
not, however, in general sub-groups of each other, and therefore the curve 
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BD separating these phases is a line of transitions of the first kind. At B, all 
three phases are identical. 

Figure 68 shows a possible type of intersection of several lines of tran- 
sitions of the second kind. If I is the most symmetrical phase, the symmetry 
groups of phases II and III are sub-groups of that of phase I, and that of 
phase IV is a sub-group of both those of phases II and III? 


Fic. 68 


Finally, it remains to consider the case where the third-order terms in the 
expansion of the thermodynamic potential do not vanish identically. In this 
case the condition for the existence of a point of continuous phase transition 
requires that the coefficients C,(P,7T) of the third-order invariants in the 
expansion (145.6) should vanish, as well as A(P, 7). It is evident that this is 
possible only if there is not more than one third-order invariant, since other- 
wise we should obtain more than two equations for the two unknowns P 
and T. When there is only one third-order invariant, the two equations 
A(P, T) = 0, C(P, T) = 0 determine pairs of values of P and T, i.e. there are 
isolated points of continuous phase transition. . 

Since these points are isolated, they must lie in a certain way at the inter- 
section of curves (in the PT-plane) of phase transitions of the first kind. Since 
such isolated points of continuous transition have not yet been observed 
experimentally, we shall not pause to give a detailed discussion here, but 
simply mention the results.* 

The simplest type is that shown in Fig. 69a. Phase I has the higher sym- 
metry, and phases II and III the same lower symmetry, these two phases 


t The fluctuation corrections may probably cause a singularity at B, the curves 
AB and CB forming a cusp. 

tA point of intersection of the type shown in Fig. 67 is called bicritical in the 
literature, and one as in Fig. 68 tetracritical. 

§ See L. Landau, Zhurnal éksperimental’nol i teoreticheskol fiztki 7, 19, 1937; 
translation in Collected Papers of L. D. Landau, p. 193, Pergamon, Oxford, 1965. 
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differing only in. the sign of 7. At the point of continuous transition (O in 
Fig. 69) all three phases become identical. ee 

In more.complex cases two or more curves of phase transition of the first 
kind (e.g. two in Fig. 69b) touch at the point of continuous transition. Phase I 
has the highest. symmetry, phases II and III a lower symmetry, phases IV 
and V another lower symmetry, these pairs of phases differing only in the 
sign of 7. 


Fic. 69 


§ 151. Phase transitions of the second kind ina two-dimensional lattice 


The impossibility of a general theoretical determination of the critical 
indices lends especial interest to a simple model which allows an exact 
analytical solution of the problem of phase transitions of the second kind. 
This is a two-dimensional lattice model, for which the phase-transition 
problem was first solved by L. Onsager (1944)." 

' The model considered is a plane square lattice having N points, at each of 
which is a “dipole” with its axis perpendicular to the lattice plane. The dipole 
can have two opposite orientations, so that the total number of possible 
configurations of the dipoles in the lattice is 2.1 To describe the various 
configurations we proceed as follows. To each lattice point (with integral 
coordinates k, /) we assign a variable o,,; which takes two values +1, corre- 
sponding to the two possible orientations of the dipole. If we take into ac- 
count only the interaction between adjoining dipoles, the energy of the 
configuration may be written 


L 
E(o) =—-J no ; (OKI0K, 14.1 + OKIFK+1, 1)» (151.1) 


t The original method used by Onsager was extremely complex. Later, various 
authors simplified the solution. The method described below (which in part makes 
use of certain ideas in the method of M. Kacand J. C. Ward (1952)) is due to N. V. 
Vdovichenko (1964). 


t This model is known as the Ising model; it was in fact first used by W. Lenz 


(1920) and was studied by E. Ising (1925) for the one-dimensional case (in which 
there is no phase transition). 
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where L is the number of points in a lattice line, the lattice being regarded as 
a large square, and N = L?.t The parameter J (> 0) determines the energy of 
interaction of a pair of adjoining dipoles, which is —J and +-J for like and 
unlike orientations of, the two dipoles respectively. Then the configuration 
with the least energy is.the “completely polarised” (ordered) configuration, 
in which all the dipoles are oriented in the same direction. This configuration 
is reached at absolute zero; as the temperature increases, the degree of 
ordering decreases, becoming zero at the transition point, when the two 
orientations of each dipole become equally probable. 

The determination of the thermodynamic quantities requires the calcula- 
tion of the partition function 


Z a Decne = fceP {° pi, (oH, 141 tones], (151.2) 


taken over all the. 2" possible configurations (6 = J/T). The equation 
exp (Oo,107) = cosh 6+0;;0." sinh 6 = cosh (1 + oxo, tanh 8) 
is easily verified by expanding both sides in powers of 6 and using the fact 
that all the o?, = 1. The expression (151.2) can therefore be written 
Z = (1—x*)-"'S, (151.3) 
where 


L 
S= d J1,¢ + XOKICK, 141) (Ll + XOK0K41, 1) (151.4) 


andx=tanh#. ss. 

The summand in (151.4) is a polynomial in the variables x and o,;. Since 
each point (k, /) has four. neighbours, each o,, can appear in the polynomial 
in powers from zero to four. After summation over all the o,,=-+1 the 
terms containing odd. powers of o,,; vanish, and so a non-zero contribution 
comes only from terms containing o,, in powers 0, 2 or 4. Since of, = of) = 
of, = 1, each term of the polynomial which contains all the variables o,, 
in even powers gives a contribution to the sum which is proportional to the 
total number of configurations, 2”. 

Each term of the polynomial can be uniquely correlated with a set of lines 
or “bonds” joining various pairs of adjoining lattice points. For example, the 
diagrams shown in Fig. 70 correspond to the terms 


(a) XO, 09 41,1541, 1-15 

8 3 
(b) x82 07 5 1,198 +1,1-19, 1-19, 1-20 —1,1-19F-1,1-2» 

10 2 2 2 
(c) x Ok + 1,194 1,1-19R, 1-192, 1-19 1, 1-1 

3 aD, 
X Of 1,1-20k—1,1-30k—2, 1-3 Fk—2, 1-28 
t The number Z is, of course, assumed macroscopically large, and edge effects 


(due to the special properties of points near the edges of the lattice) will be neg- 
lected throughout the following discussion. 
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Each line in the diagram is assigned a factor x and each end of each line a 
factor 04). 

The fact that a non-zero contribution to the partition function comes only 
from terms in the polynomial which contain all the o,, in even powers signi- 
fies geometrically that either 2 or 4 bonds must end at each point in the dia- 
gram. Hence the summation is taken only over closed diagrams, which may be 
self -intersecting (as at the point k, /— 1 in Fig. 70b). 


(a) (b) (c) 

e e e e e e e e e e e e e eo e 
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k={,l-f kl-t | k-2,(-1 k-I,(-l 
e ° e e e e re e 

k+{,b-t | k+f,t-i k-2,l-2 “k,l-l k+t,U+t 
e e e e e e e @ e 6 e 
k-1l-2. k,l-2 k-1,l-2 

e e e e e ® e e e 

k-2,l-3 k-1,l-3 
e ® ® e e e 

Fic. 70 
Thus the sum S may be expressed in the form 
S = 2N}' xg, (151.5) 
r 


where g, is the number of closed diagrams formed from an (even) number r 
of bonds, each multiple diagram (e.g. Fig. 70c) being counted as one. 

The subsequent calculation is in two stages: (1) the sum over diagrams of 
this type is converted into one over all possible closed loops, (2) the resulting 
sum is calculated by reducing it to the problem of the “random walk” of a 
point in the lattice. 

We shall regard each diagram as consisting of one or more closed loops. 
For non-self-intersecting diagrams this is obvious; for example, the diagram 
in Fig. 70c consists of two loops. For self-intersecting diagrams, however, the 
resolution into loops is not unique: a given diagram may consist of different 
numbers of loops for different ways of construction. This is illustrated by 
Fig. 71, which shows three ways of representing the diagram in Fig. 70b as 
one or two non-self-intersecting loops or as one self-intersecting loop. Any 
intersection may similarly be traversed in three ways on more complicated 


Pes 
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It is easy to see that the sum (151.5) can be extended to all possible sets 
of loops if, in computing the number of diagrams g,, each diagram is taken 
with the sign (— 1)", where 7 is the total number of self-intersections in the 
loops of a given set, since when this is done all the extra terms in the sum 
necessarily cancel. For example, the three diagrams in Fig. 71 have signs 
+, +, — respectively, so that two of them cancel, leaving a single contribu- 
tion to the sum, as they should. The new sum will also include diagrams with 
“repeated bonds”, of which the simplest example is shown in Fig. 72a. These 
diagrams are not permissible, since some points have an odd number of 
bonds meeting at them, namely three, but in fact they cancel from the sum, as 


4a 


(b) (c) 
Fic. 72 


they should : when the loops corresponding to such a diagram are constructed 
each bond in common can be traversed in two ways, without intersection (as 
in Fig. 72b) and with self-intersection (Fig. 72c); the resulting sets of loops 
appear in the sum with opposite signs, and so cancel. We can also avoid the 
need to take into account explicitly the number of intersections by using the 
geometrical result that the total angle of rotation of the tangent in going 
round a closed plane loop is 2n({/+1), where / is a (positive or negative) 
integer whose parity is the same as that of the number » of self-intersections 
of the loop. Hence, if we assign a factor e** to each point of the loop (with 
the angle of rotation there # = 0, +47), the product of these factors after 
going round the whole loop will be (—1)’+?, and for a set of s loops the 
resultant factor is (—1)"**, wheren = )'v. 

Thus the number of intersections need not be considered if each point on 
the loop is taken with a factor e*® and a further factor (—1)° is taken for the 
whole diagram (set of loops) in order to cancel the same factor in (—1)"t*. 

Let f, denote the sum over single loops of length r (i.e. consisting of 
r bonds), each loop having a factor e¥* at each point on it. Then the sum 
over all pairs of loops with total number of bonds r is 


1 
a,» Sr, m9 


+rgur 


the factor 1/2! takes into account the fact that the same pair of loops is 
obtained when the suffixes r, and r, are interchanged, and similarly for groups 
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of three or more loops. Thus the sum S becomes 


co 1 co 
S= YM OL th Se: 


© Vy Fay oe 


Since S includes sets of loops with every total length r,++r.+°..., the num- 
bers 7, 7, ... in the inner sum take independently all values from 1 to .* 
Hence 


Sooxnte tiff, = (5%) 


Vr onig Vy 


and S becomes 


S = exp (-¥,#). (151.6) 


This completes the first stage of the calculation. 
It is now convenient to assign to each lattice point the four possible direc- 
tions from it and to number them by a quantity v = 1, 2, 3, 4, say, as follows: 


2 


We define as an auxiliary quantity W,(k, /, v) the sum over all possible 
paths of length r from some given point kp, 4, % to a point k, /, v (each bond 
having.as usual the factor e**, where @ is the change of direction to the next 
bond); the final step to the point k, /, y must not be from the point to which 
the arrow » is directed.t With this definition, W, (ko, 4; ¥)-is the sum over 
all loops leaving the point k,, /, in the direction ») and returning to that point. 
It is evident that 


1 
Sf a ope. W,(Kos lo, Vo): (151.7) 


both sides contain the sum over all single loops, but }W, contains each loop 
2r times, since it can be traversed in two opposite directions and can be 
assigned to each of r starting points on it. 


+ Loops with more than N points make no contribution to the sum, since they 
must necessarily contain repeated bonds. 
tT In fact W,(k, 1, v) depends, of course, only on the differences k—kp, [—lp. 
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From the definition of W,(k, /, v) we have the recurrence relations 
W41(k, 1,1) = W,(kK—-1, J, 1) +e7~*#"W,(k, 1—1, 2) 
+0+etW,(k, 1+1, 4), 
W,41(k, 1, 2) = e**W,(k—1, 1, 1)+W,(k, /—1, 2) 
+e-#ay, (k +1, 1, 3)+0, 
W,41(k, 1, 3) = O+et*W,(k, 1-1, 2) 
+W,(k+1, 1, 3)+ e~*#"W, (k, 1+1, 4), 
W 41k, 1, 4) = e~ tw, (k—-1, 1, 1)4+0 
+ et, (k +1, 1, 3)+W,(k, 1+1, 4). 
The method of constructing these relations is evident: for example, the point 
k, I, 1. can be reached by taking the last (r +1)th step from the left, from below 
or from above, but not from the right; the coefficients of W, arise from the 
factors e*?, 


Let A denote the matrix of the coefficients in equations (151. 8) (nth all 
k, 1, written in the form 


Wrarlks h r)= 3 Alklv| ky) Wk, U, ¥). 
| a 4 : 


(151.8) 


The method of constructing these equations enables us to associate with this 
matrix an intuitive picture of a point moving step by step through the lattice 
with a “transition probability” per step from one point to another which is 
equal to the corresponding element of the matrix A; its elements are in fact 
zero except when either k or / changes by +1 and the other remains constant, 
i.e. the point traverses only one bond per step. It is evident that the “proba- 
bility” of traversing a length r will be given by the matrix A’. In particular 
the diagonal elements of this matrix give the “probability” that the poin, 
will return to its original position after traversing a loop of length r, i.e. they 
are equal to W,, (ko, J, »). Hence 


tr Ar = » W, (Ko, lo, Vo). 
Kos los %0 
Comparison with (151.7) shows that 
1 r 1 r 
7; = oe tr A = Op py us 


where the A, are the eigenvalues of the matrix A. Substituting this expression 
in (151.6) and interchanging the order of summation over i and r, we obtain 


1 coal | 
Sen {-3% reat 
= exp {3 by log (1 -xo} 


= [[-v—29. (151.9) 
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The matrix A is easily diagonalised with respect to the suffixes k and / by 
using a Fourier transformation: 


L 
W, (Ps q, ¥) = ae e~2al(pk + g/L, (k, 1, v). (151.10) 


Taking Fourier components on both sides of equations (151.8), we find that 
each equation contains only W,(p, qg, v) with the same p, q, so that the matrix 
A is diagonal with respect to p and q. For given p, g its elements are 


e7P av~le-¢ 0 “Ee? 

ae? E—@ a~lep =O 
A(pqv | pqv’) = 

0 aen?% EP anleg | 

a-le-P 0 aeP e@ 


where « = e%, ¢ = erlL, 
For given p, q a simple calculation shows that 


a (1—xA,) = det (8,7 —xA,y) 


= (142x2)?—2x(1 — x2) (cos ald Oe =). 


Hence, from (151.3) and (151.9), we finally obtain the partition function 


Z = 2M(1—x4)- [] [+e 
P,gm0 
2np 


~2x(1 =x") (608 FP 4-008 a) (451.11) 


The thermodynamic potential ist 


®=-TlogZ 
= —NT log 2+NT log (1—x?) 


1 7 2p 27q 
—— TT 2)2 __ yf 
5 2a log la + x2)? —2x(1 — x?) (cos = +008 = )| 


t In the model under discussion the temperature affects only the ordering of 
dipole orientations, not the distances between dipoles (the “thermal expansion 
coefficient” of the lattice is zero). It is then immaterial whether we consider the 
free energy or the thermodynamic potential. 
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or, changing from summation to integration, 
® = —NT log 2+4NT log (1—-x?) 


Qn 2x 
NT 
~ 2@nyP | | log [(1 +.x2)?—2x(1 — x2) (cos 1+ 008 w2)} dev, dons 
0 


0 


(151.12) 
(remembering that x = tanh (J/T)). 

Let us now examine this expression. The function ®(7) has a singularity 
at the value of x for which the argument of the logarithm in the integrand can 
vanish. As a function of w, and @,, this argument is a minimum for cos @, = 
COS W, = 1, when it equals (1 +x?)?—4x(1 — x?) = (x2+2x—1)%. This ex- 
pression has a minimum value of zero for only one (positive) value of x, 
%, = /2—1; the corresponding temperature T, (tanh (J/T,) = x,) is the 
phase transition point. 

The expansion of P(t) in powers of t = T—T, near the transition point 
includes a singular term as well as the regular part. Here we are interested 
only in the singular term, the regular part being simply replaced by its value 
at t = 0. To find the form of the singular term, we expand the argument of 
the logarithm in (151.12) in powers of w,, @, and ft about the minimum; the 
integral then becomes 


2x 2n 
J f log [c,f* + co(w? + w3)] dw, dw,, 
0 0O 


where c, and c, are constants. Carrying out the integration, we find that the 
thermodynamic potential near the transition point has the form 


® = a—+b(T-T,)* log |T—T,|, (151.13) 
where a and b are further constants (with b > 0). The potential itself is 
continuous at the transition point, but the specific heat becomes infinite in 
accordance with the formula 

C = blog|T—T,I, (151.14) 
which is symmetrical about the transition point. 

In this model, the order parameter is represented by the mean dipole 
moment at the lattice point (the spontaneous polarisation of the lattice), 
which is non-zero below the transition point and zero above it. The tempera- 
ture dependence of this quantity can also be ascertained; near the transition 
point, the order parameter tends to zero according to 


= constant X(T, —T)¥8 (151.15) 
(L. Onsager, 1947).* 


t A comparatively simple method of solving this problem is given by N. V. 
Vdovichenko, Soviet Physics JETP 21, 350, 1965. 
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The correlation function is defined as the mean value of the product of the 
fluctuations of the dipole moment at two lattice points. The correlation 
radius is found to tend to infinity as 1/|T—T,] when T > T,, and at the 
point TJ = T, itself the correlation function decreases with increasing dis- 
tance according to 


(Aon Amn) 0 [(k—m)? + (1—n)?}]-*. 


These results, and those of solving the problem of the properties of the same 
model in an external field, show that the behaviour of the model near the 
phase transition point satisfies the requirements of the scale invariance 
hypothesis. The critical indices have the following values: 


a= 0, B= 4 y=, 6=15, e¢=0, b= iy 
y=l, C=4; (151.16) 


¢ is found from (148.7) with d = 2.t 


§ 152. Van der Waals theory of the critical point 


It has been noted in§ 83 that the critical point of liquid/gas phase transitions 
is a singularity of the thermodynamic functions of the substance. The phys- 
ical nature of this singularity is similar to that which occurs at phase transi- 
tion points of the second kind: in the latter case it is due to the increased 
fluctuations of the order parameter, and in the approach to the critical point 
there are increased fluctuations of the density of the substance. This analogy 
in the physical nature results in a certain analogy in the possible mathemati- 
cal description of the two phenomena, to be discussed in § 153. 

First, however, let us consider as a necessary preliminary the description 
of critical phenomena based on neglecting the fluctuations. In such a theory 
(analogous to the Landau approximation in the theory of phase transitions 
of the second kind) the thermodynamic quantities for the substance as 
functions of the variables V and T are assumed to have no singularities, so 
that they can be expanded in powers of the small changes in these variables. 
The results given in the rest of this section therefore depend only on the 
vanishing of the derivative (OP/OV )z. 

Let us first ascertain the conditions for the substance to be stable when 


(OP/OV)r = 0. (152.1) 


t As regards the critical indices, a logarithmic increase corresponds to a zero 
exponent (see after (148.7)). 


§ 152 Van der Waals Theory of the Critical Point 507 


In deriving the thermodynamic inequalities in § 21, we started from the condi- 
tion (21.1), which led to the inequality (21.2), which is satisfied if the 
conditions (21.3), (21.4) hold. The case (152.1) of interest here corresponds 
to the particular case of the extremum conditions with the equality 
sign in (21.4): | 
| CF CE OE \? 
ast ar (aras) > So) 

The quadratic form in (21.2) may now be either positive or zero, depending 
on the values of 6S and 6V, and so the question whether E—7oS +PoV has a 
minimum requires further investigation. 

We must obviously examine the case where in fact the equality sign occurs 
in (21.2): 

OE 


are 2 __ 
ea Easy 42 OF asar °5 av42 = OY = (152.3) 


Using (152.2), this equation may be written 


2 ‘ 
} (ss iS oe = wv) - 


OE 1 [3 OE]? 
OE/OS? \ OS? 5 oV 


SEOS® t nal == (C,/T) (OTP = 0. 


Thus the equation (152.3) implies that we must consider deviations from 
equilibrium at constant temperature (d7 = 0). 

At constant temperature the original inequality (21.1) becomes 
6F +P 6V > 0. Expanding 6F in powers of 6V and making use of the 
assumption that 0?F/0V? = —(O0P/OV); = 0, we find 


1 /P so fer ; 


If this inequality holds for all 6V, we must havet 
(2P/OV2)7 = 0, (°P/AV%)r < 0. (152.4) 


Let us now consider the equation of state of a substance near the critical 
point. Instead of the variables T and V, it will be more convenient to use 
the variables JT and n, where nis the number of particles per unit volume. 
We shall also use the notation 


t=T-T, p=P-P, yn=n-n. (152.5) 


t It may be noted that the case of equality in (21.3) is impossible in this discus- 
sion, since the condition (21.4) would then be violated. The simultaneous van- 
ishing of the two expressions (21.3) and (21.4) is also impossible: if we add a further 
condition to the vanishing of (OP/6V)7 and (0*P/dV?)z, there result three equations 
in two unknowns, which in general have no common solution. 
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In these variables, the conditions (152.1) and (152.4) become 
(Op/en), = 0, (0*p/dn*), = 0, (0%p/d7),>O at t=0. (152.6) 


Taking only the first terms in the expansion in powers of small ¢ and y, we 
write the dependence of pressure on temperature and density in the form 


p = bt+2atn+4Br? (152.7) 


with constant a, b and B. There are no terms in 7 and 7? in this expansion, 
from the first two conditions (152.6), and from the third condition B > 0. 
When ¢ > 0, all states of a homogeneous body are stable (there is nowhere a 
separation into phases), and we must have (dp/67), > 0 for all 7, whence 
a > 0. The terms in fy? and fn may be omitted, since they are certainly small 
in comparison with that in 77; the term in ry itself must be retained, since it 
contributes to the derivative 
(Op/On); = 2at+12Br? ~ (152.8) 
needed below. 
The expression (152.7) determines the isotherms of a homogeneous sub- 
stance near the critical point (Fig. 73). These have a form similar to the van der 
Waals isotherms (Fig. 19, § 84). When ¢ < 0, they have a minimum and a 


maximum, and a horizontal section (AD on the bottom isotherm) corre- 
sponding to the condition (84.2) gives the equilibrium transition from liquid 
to gas. Taking V in this condition to be the molecular volume 


_1 1 17) 
v= oe (152.9) 
we can write the condition as 
D Nz 
ain (32) dn = 0. (152.10) 
on |, 
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After substitution of (152.8), this condition gives the following values for 
the densities of the two phases in equilibrium: 


M1 = —N2 = V/(—at/2B). (152.11) 


The densities 4, and 7, corresponding to the limits of the metastable 
regions (points B and C in Fig. 73) are given by the condition (dp/07),, 
whence’ 

ni =—n, = /(—at/6B). (152.12) 


Substitution of (152.11) makes the sum of the last two terms in (152.7) 


zero. Hence 
p= bt (t < 0) (152.13) 


is the equation of the curve of equilibrium of liquid and vapour in the pt- 
plane (and therefore b > 0).+ According to the Clapeyron—Clausius jequa- 
tion (82.2), the heat of evaporation near the critical point is 


q = bT,.(,—N)/ne. (152.14) 


It then follows from (152.11) that as ¢ - 0 this heat tends to zero according 


to 
qc /f/—-t. (152.15) 


From (16.10) it follows that the specific heat C, becomes infinite at the 
critical point while (Ap/0n), tends to zero. From (152.8) we find 


C, « 1/(at +6Br?). (152.16) 


In particular, for states on the equilibrium curve, 7 oc »/—1?, and therefore 
Cc, ~ 1-4. 

Lastly, let us consider in this theory the density fluctuations near the criti- 
cal point. The necessary formulae have already been derived in§ 116, and to 
apply them we need only establish the specific form of 4F,, the change in the 
total free energy of the body in a deviation from equilibrium. 

We write 

AF, = |(F—F) dv, 


where F is the free energy per unit volume and F its mean value, which is 
constant through the body, and expand F—F in powers of the density 
fluctuation 4n = n—ZA (or, equivalently, 4n = 7—#) at constant temperature. 
The first term in the expansion is proportional to 4n, and on integration 


t In a theory which takes account of the singularity of the thermodynamic quan- 
tities at the boundary of the metastable states, there is no curve BC. 

t When ¢ = 0, equation (152.13) determines the critical isochore, the curve of 
constant density (7 = 0) passing through the critical point. 
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over the volume it vanishes, since the total number of particles in the body 
remains unchanged. The second term is* 


POR oe Le POP eine 
x (ar) 2°) = a (an) rv. 


Together with this term, which vanishes at the critical point itself, we must 
also include another term quadratic in 4n, due to the inhomogeneity of a 
body with fluctuating density. Without repeating here the arguments given 
in § 146, we can state at once that this term is quadratic in the first derivatives 
of dn with respect to the coordinates; in an isotropic medium, such a term 
must be the square of the gradient. Thus we arrive at an expression of the 


form+ 
= 1 (Op 2, (Oo 4An\? 
AF, = hes (sr) (an +83 dV. (152.17) 


Now expressing An as a Fourier series (116.9), we can bring this to the 
form (116.10), with the function . 


eee 2 
10) = 5. (Gn) +28 


= 2 (at + 6By7") + 2gk?, 
c 


and then, using (116.14), find the Fourier transform of the required correla- 


tion function: 
(k) = 2T fat + 6Br? + gnek?]-! ; (152.18) 


since the denominator here is small, the term 1 in »(k) may be neglected. 
This formula is entirely analogous to (146.8). The correlation function »(r) 
in the coordinate representation therefore has the same form (146.11), with 
the correlation radius 

re = [gne/(at + Br)”. (152.19) 


In particular, r, oc t—*” on the critical isochore (7 = 0). 


t Since the free energy F is referred to a fixed (unit) volume of substance, not 
to a fixed number of particles in it, (OF /On)r = ps. The second derivative is 


CED (CH ae tf OR 
(at), ~ (an), ~ > (ae), 


since, for T= constant, du = v dP, where v = 1/n is the molecular volume. 


+ The fact that AF, is expressed as an integral of afunction of a point in the body 
(not of two points as in the general expression (116.8)) results from the assumption 
that dn varies slowly; the long-wavelength components of the density fluctuations 
are considered. 
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§ 153. Fluctuation theory of the critical point 


The formulae derived in § 152 enable us to establish a certain analogy be- 
tween the thermodynamic description of the properties of a substance near 
the critical point and near phase transition points of the second kind. 

To do so, we proceed in the spirit of the Landau theory by first regarding 
7 not as a definite function of P and T but as an independent variable whose 
equilibrium value is given by minimising a thermodynamic potential 
@(P, T, 7). The latter is to be chosen so that this minimisation in fact gives 
the correct. equation (152.7). The requirement is met by the expressiont 


N ete 
@(P, T, 1) = PoP, T) + [—(p — bt)y 4 atr? + Bn*}. (153.1) 


Comparing (153.1) and (144.3), we now see that there is an analogy be- 
tween the description of a phase transition of the second kind in an external 
field in the Landau theory and the description of a critical point between a 
liquid and a gas in the van der Waals theory. In the latter case, the order 
parameter is represented by the change in density of the substance, 7 = 
n—n,, and the external field by the difference 


h = p—bt. (153.2) 


If O(t, h) is the thermodynamic potential of the body near the phase transi- 
tion point of the second kind (for some fixed value of the pressure), the 
expression @(t, p—bt) gives the form of the thermodynamic potential near 
the critical point. The whole of the discussion in § 146 about the change 
from the potential ® to the potential 2 is valid for any case, and so the analo- 
gy applies also to the potentials 2 in the two problems. 

It has been shown in§ 147 how we can go from the thermodynamic poten- 
tial Q in the Landau theory to the effective Hamiltonian which describes 
the phase transition in exact fluctuation theory. The analogy mentioned 
therefore leads us to expect that the laws of behaviour of the thermody- 
namic quantities near the critical point are the same (with the appropriate 
change of significance of 7 and A) as the limiting laws in the fluctuation region 
of the phase transition of the second kind in an external field (which is 
described by only one order parameter). 

It must be emphasised immediately that this identification can only be 
approximate. In the theory of phase transitions based on the effective 


t The (here unimportant) coefficient of the square bracket is chosen so that after 
minimisation the expression (153.1) gives the correct potential O(P, 7). It may seem 
strange that (153.1) is not symmetrical in p and 1, in that the coefficient of 77 does 
not involve p. In reality, the 7? term is significant only if the coefficient p—bt of 
7 is small, and in that case we can equally well write atn* or apy*/b. 
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Hamiltonian (147.6), there is an exact symmetry with respect to the trans- 
formation h + —h, 1 + —7 (because of the exact vanishing of the third- 
order term ~ 7°). In critical-point theory, this symmetry is only approx- 
imate; the absence from (153.1) (and therefore from the effective Hamiltonian) 
of terms which violate this symmetry is due only to neglecting them as small 
in comparison with the remaining terms. We can therefore assert only that 
the principal terms in the limiting relationships in the two cases must be 
the same.* 

In the theory of phase transitions with ¢ > 0 and h = 0, we have 7 = 0; 
with ¢ < 0 and h - 0, two phases are in equilibrium which have non-zero 
values 71 and 72 of the order parameter, where 71 = — 72 (the points A and 
A’ in Fig. 64b, § 144); this equation is an exact consequence of the above- 
mentioned symmetry of the effective Hamiltonian. For a critical point, the 
corresponding equation is 


p—bt = 0, (153.3) 


which determines the critical isochore (7 = 0, i.e. n = n,) for t > 0, and the 
liquid/vapour equilibrium line for t < 0. The equation 72 = —71 here signi- 
fies the symmetry of the phase equilibrium line in the ty-plane, and an exten- 
sion of the analogy shows that these values tend to zero as t + 0 according 


to 
71 =—m2 oc (—1F, (153.4) 


with the same exponent as in (148.5).+ But, since the invariance of the 
effective Hamiltonian under a change in the sign of 7 (when h = 0) is only 
approximate, the question arises of the limiting temperature dependence 
of the sum 71+72. From the foregoing discussion we can say only that it is 
of a higher order of smallness than 71 and 72 themselves; we shall return 
to this at the end of the section. 

Figure 74 shows the phase diagram in the f-plane. The region of separa- 
tion into two phases is hatched, and its boundary is a symmetrical curve, in 
accordance with (153.4). 


t The analogy described must not, of course, conceal the physical difference 
between the two phenomena: for a phase transition of the second kind, we have 
a whole curve of transition points, which separates (in the PT-plane) the regions of 
existence of two phases with different symmetry. A critical point, however, is an 
isolated point (at the end of an equilibrium curve) in the phase diagram of two 
phases having the same symmetry. 

t Here and in the rest of § 153, when referring to the critical indices for transi- 
tions of the second kind, we mean specifically the values of these indices for tran- 
sitions described by only one order parameter, with an effective Hamiltonian of 
the form (147.6). The van der Waals theory of the critical point corresponds to 
the values of the indices given for the Landau theory in the footnote to (148.11). 
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The heat .of evaporation is related to the difference 71—1n2 by (152.14). 
It therefore tends to zero as |t| -> 0, by the same law 


q ox (—t. (153.5) 


The general equation of state of a homogeneous substance throughout the 
neighbourhood of the critical point (in the 77-plane) may be written 


p—bt ==+!n!? f(t/|n|""), (153.6) 


where the two signs correspond to 7 > 0 and 7 < 0 (B. Widom, 1965). 
This formula corresponds to equation (148.18) in the theory of phase tran- 
sitions (when solved for /). 

The same considerations of analyticity apply to f(x) in (153.6) as were 
discussed in § 149 for transitions of the second kind. 

For example, with a given non-zero value of 7, a change in the sign of 
t nowhere causes a passage through the critical point; hence t = 0 is not a 
singularity of (153.6). This function may therefore be expanded in integral 
powers of ¢. Thus f(x) is expanded in integral powers of x. The first two terms 
of this expansion are 1+c,x, so that the equation of state becomes 


poe cin! [ter ag + | for |tl<|n|M; (153.7) 


the first term in the expansion corresponds to the definition (148.10) for a 
strong field in the theory of phase transitions. In Fig. 74 the dotted lines 
show diagrammatically the boundaries of the region to which this equation 
of state applies. In this region, two further limiting cases may be distin- 
guished. If ¢<« p (in particular, on the critical isotherm, i.e. on the line 

= 0), then 
pot’. (153.8) 


If ¢ > p (in particular, on the critical isobar, i.e. on the line p = 0), then 
toct/n/é. (153.9) 
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A comparison of (153.8) and (153.9) shows that there is symmetry between 
p and t, as there should be.* 

Similarly, for a given non-zero value of 1, the value n = Ois not a singu- 
larity. Hence, for t > 0 and 7 —~ 0, the function (153.6) can be expanded in 
integral powers of 7, and the expansion can contain only odd powers, again 
because of the symmetry of the effective Hamiltonian under a simultaneous 
change of sign of 7 and A. From this it follows that? 


F(x) oc xP(eyx-F + cgx-8 +...) for x +005 


the factor x’ cancels the non-integral power 17’, and the expansion variable 
x? oc ». Thus the equation of state becomes 


p—bt oc Plemt+csrpt-7F + 2...) for to |n|¥F, (153.10) 


where we have used the equation 86 = 8 +y, (148.14). The region where this 
equation is valid is also shown diagrammatically in Fig. 74. The first term of 
the expansion (153.10) corresponds to the relation 7 = yh oc ht~” in the 
theory of phase transitions in a weak field. 

The behaviour of the derivatives of p of various orders with respect to 7 
(with ¢ constant) depends on the direction in the 7t-plane along which the 
critical point is approached. If the approach is along the critical isotherm 
(t¢ = 0), the function p(y) is given by (153.8). In practice, the value of 6 lies 
between 4 and 5. Hence not only (0p/07), but the derivatives of several 
higher orders tend to zero along the critical isotherm. 

If the critical point is approached in any other direction (lying outside 
the region of separation into two phases, i.e. along a radial line t = constant X 
In| with the constant positive), the inequality t>> ||" holds, since in 
practice 1/8 > 1. The equation of state then gives 


(Op/On); oc ” + 0, 


and the second derivative is 
(©p/0n?), oc nt?-*6 = pr-Py/t. 


The factor 7/t” « 1, and f’~? - 0, since in practice y > 8. Thus the deriva- 
tive (0?p/07?), also tends to zero. 


t For t o 7°, the argument of f(x) in (153.6) is x oc t/f/® << 1, since in practice 
fd = B+y > 1. This proves that the case ¢ >> p is in fact possible in the equation 
of state (153.7). 

+ The case x -» — co is not realistic, since values of |n{¥* < |r| with ¢ < 0 are 
in the region of separation. 
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The behaviour of the specific heat of the substance in the critical region 
can be ascertained from the expression for the thermodynamic potential, 


P(p, t) = |h?-* h(t/|h|O*), hb = p—bt, (153.11) 


which is written down immediately by analogy with (149.7) in the theory 

of phase transitions, with the identical substitution of indices dy = 2—a, 

p/v = 1/(8+y). Without repeating the whole argument, we can put down 

at once, by analogy with (149.9) and (149.10), the necessary limiting expres- 
sions :f 

O(p,Hecf-- for t>0, h-O, (153.12) 

D(p, t) oc (—1)?-* [1 +e1|h|/(—o%+7] for t<0, A+>O. (153.13) 


By twice differentiating (153.12), we find the specific heat on the critical 
isochore p—bt = 0, t > 0: 
Creer, (153.14) 


Since differentiation with h = 0 and ¢ > 0 signifies differentiation with 7 =0, 

this is the specific heat at constant volume. Thus the specific heat C, on the 

critical isochore behaves like C, in a phase transition of the second kind. 
According to (16.10), 


(Op/00); 
Cy—-C a, 
me len 
As the critical point is approached, the derivative (6p/6#), tends to a con- 
stant limit b, as is easily seen from the equation of state (153.7) or (153.10). 
Hence 
C, < (Op/dn);?. (153.15) 


The divergence of this expression as the critical point is approached is 
stronger than that of C,; the term C,, has therefore been omitted in compari- 
son with C,,. 

Lastly, let us consider the asymmetry of the curve of coexistence of phases 
near the critical point (V. L. Pokrovskil, 1972). As already mentioned, this 
asymmetry can occur only when terms are taken into account in the effective 
Hamiltonian which make it no longer symmetrical under the transformation 


t It may be recalled that ® here denotes (as in § 149) the singular part of the 
thermodynamic potential. While it is a small correction to the principal (non- 
singular) part, it also gives a similar correction to the other thermodynamic 
potentials. On the phase equilibrium curve, the characteristic magnitude of this 
correction is oc 22~*, a point that will be made use of in § 154. 
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h + —h, y +—n. The first such term ist ~ 72h; its occurrence may be 
formally regarded as resulting from the replacement of ¢ in the effective 
Hamiltonian by ¢+constant xh, so that 


ar?t -- ar?(t-+constantXh). 


This change in the effective Hamiltonian causes a similar change in the 
thermodynamic potential expressed as a function of h and ft: 


@(h, t) ~ Och, t+constant xh). 


Near the curve of coexistence of phases, the function (A, £) is given by 
(153.13); the required density is found by differentiating with respect to h. 
The result is 


[Vnln-++0 = (—OO/dh), 40 
oc Fe1(—t)? + (2—«) constant x (—f)-*. 


The first term gives the known values (153.4) of the densities on the symmetri- 
cal coexistence curve; this term disappears in the sum 71+72, leaving 


nity © (—fi-*. (153.16) 


which gives the required form. In practice 1—« > £, so that the asymmetry 
is in fact relatively slight: (q1-+12)/n1 ~ 0 as t + 0. The sum 71+72 is in 
practice positive; this means that, when it is taken into account, the shape 
of the coexistence curve is changed in the manner shown in Fig. 75. 


‘| 


t The inclusion in the effective Hamiltonian of a term ~ 7° would not destroy 
the symmetry, since such a term could be eliminated by simply applying a trans- 
formation 7 -» 7+constant X t. In this connection it may be recalled (cf. the foot- 
note following (149.7)) that 7 in the effective Hamiltonian is only a continuum 
integration variable, and therefore the transformation mentioned does not affect 
the integral over states. 
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SURFACES 


§ 154. Surface tension 


HiTHERTO we have entirely neglected effects resulting from the presence of 
surfaces of separation between different bodies.‘ Since, as the size of a body 
(i.e. the number of particles in it) increases, surface effects increase much 
more slowly than volume effects, the neglect of surface effects in the study of 
volume properties of bodies is entirely justified. There are, however, a num- 
ber of phenomena which depend in fact on the properties of surfaces of 
separation. 

The thermodynamic properties of such an interface are entirely described 
by one quantity, a function of the state of the bodies, defined as follows. We 
denote by 8 the area of the interface, and consider a process whereby this 
area undergoes a reversible change by an infinitesimal amount d8. The work 
done in such a process is obviously proportional to d8, and so can be 


written as 
dR = «a d8. (154.1) 


The quantity « thus defined is a fundamental characteristic of the interface, 
and is called the surface-tension coefficient. 

Formula (154.1) is exactly analogous to the formula dR = —P dV for the 
work done in a reversible change in the volume of a body. We may say that « 
plays the same part in relation to the surface as the pressure does in relation 
to the volume. In particular, we can easily show that the force on unit length 
of the perimeter of any part of the interface is equal in magnitude to « and is 
directed tangentially to the surface and along the inward normal to the 
perimeter. 

Here we have assumed that « is positive. The fact that it must indeed always 
be positive is shown by the following argument. If « < 0, the contour 
bounding the surface would be subject to forces along the outward normal, 
i.e. tending to “stretch” the surface; the interface between two phases would 


t In reality, of course, phases in contact are separated by a thin transition layer, 
but the structure of this is of no interest here, and we may regard it as a geometrical 
surface. 
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therefore tend to increase without limit, and the phases would mix and cease 
to exist separately. If « > 0, on the other hand, the interface tends to become 
_ as small as possible (for a given volume of the two phases). Hence, for 
example, if one isotropic phase is surrounded by another, it will take the 
form of a sphere (the effect of an external field, e.g. gravity, being neglected, 
of course). 

Let us now consider in more detail the surface tension at the interface 
between two isotropic phases, liquid and vapour, of the same substance. 
If an interface between two phases in equilibrium is concerned, it must be 
remembered that their pressure and temperature are in a definite functional 
relation given by the equation of the phase equilibrium curve, and « is then 
essentially a function of only one independent variable, not of two. 

When surface effects are neglected, the differential of the energy of a sys- 
tem of two phases (of the same substance), for a given total volume V of the 
system, is dE = 7 dS +p dN; in equilibrium, the temperatures T and chem- 
ical potentials u of the two phases are equal, and this equation can therefore 
be written for the whole system. When the presence of surface effects is taken 
into account, the right-hand side of the equation must clearly include also 
the expression (154.1): 


dE = TdS+udN+ea dB. (154.2) 


It is, however, more convenient to take as the fundamental thermodynamic 
quantity not the energy but the potential 2, the thermodynamic potential in 
terms of the independent variables T and yw (and the volume V’). The conven- 
ience of 2 in this case arises because T and w are quantities which have 
equal values in the two phases, whereas the pressures are not in general equal 
when surface effects are taken into account; see § 156. For the differential 
dQ, again with V = constant, we have 


dQ =—S dT—N du+a dB. (154.3) 


The thermodynamic quantities (such as E, 2 and S) for the system under 
consideration can be written as the sum of a “volume” part and a “surface” 
part. This division, however, is not unique, since the number of particles in 
each phase is indeterminate to the extent of the number of particles in the 
transition layer between the phases; the same is true of the volumes of the 
phases. This indeterminacy is of just the same order of magnitude as the 
surface effects with which we are concerned. The division can be made unique 
if the following reasonable condition is imposed: the volumes V1 and V2 of 
the two phases are defined so that, in addition to the equation Vi+V2 = V, 
where V is the total volume of the system, the equation 11V1+n2V. = N is 
satisfied, where N is the total number of particles in the system, and m, = 
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ni(u, T) and nz = na(u, 7) are the numbers of particles per unit volume in 
each phase (the phases being regarded as unbounded). 

These two equations determine the choice of the volumes V1 and V2 (and 
the numbers of particles Ny = nyV1, No = n2V2), and hence also the volume 
parts of all other thermodynamic quantities. We shall denote volume parts 
by the suffix 0, and surface parts by the suffix s; by definition, N, = 0. 

From (154.3) we have, for constant J and p (and therefore constant «), 
dQ = « d8; it is therefore evident that Q, = «8, and so 


Q = Qo4+a8. (154.4) 
Since the entropy S = —(62/0T), , the surface part of it is' 
S, = —02,/8T = —3 da/dT. (154.5) 
Next, let us find the surface free energy; since F = 2+Nuy and N, = 0, 
F, = a8. (154.6) 
The surface energy is 
E, = F,+TS, = (a—T da/d7)8. (154.7) 


The quantity of heat absorbed in a reversible isothermal change of surface 
area from 8 to 3 is 


O = T(Ss2—S1) = —T(da/dT) (82-81). (154.8) 


The sum of the heat Q and the work R = «(32— 33) in this process is equal to 
the change in energy E,,—E,,, as it should be. 

The law of corresponding states (§ 84) can be qualitatively applied to the 
surface tension between a liquid and its vapour. From this law we should 
expect that the dimensionless ratio of « to a quantity with dimensions erg/cm? 
formed from the critical temperature and critical pressure will be a universal 
function of the reduced temperature 7/T,: 


a/(T Pe)? = f(T/T.); (154.9) 


at temperatures well below the critical value, this ratio is approximately 4. 

At a critical point, the liquid and gas phases become identical, the inter- 
face between them ceases to exist, and the surface tension coefficient must be- 
come zero. Using the ideas of the fluctuation theory of the critical point, 
we can express the manner of its tending to zero in terms of the critical 
indices defined in § 148. 


+ The coefficient « is a function of only one independent variable: for such a 
function the partial derivatives with respect to u and T have no meaning. But, by 
putting N, = —(02,/Ou); = 0, we have formally assumed that (Ga/Ou)7 = 0; in 
this case we clearly have da/dT = (0x/6T),, and this has been used in (154.5). 
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As the critical point is approached, the width of the transition layer be- 
tween the two phases increases and becomes of macroscopic size. Suffi- 
ciently close to the critical point, it must be of the order of the correlation 
radius r, of the fluctuations. In order to find the surface tension, we need 
only now multiply the width 7, by the characteristic thermodynamic potential 
density, or rather its singular part, which determines the critical phenomena. 
This density is oc (~—1)’~*; see the footnote to (153.12).t Thus the surface 
tension coefficient is o r,(—#)*~*; since r, oc (—f)~” from (148.6), and 
2—a = vd = 3v from (149.2), we have finally 


o oc (T,—-T)” (154.10) 
(B. Widom, 1965). In practice, 2v = 1.3.4 


§ 155. Surface tension of crystals 


The surface tension of an anisotropic body, a crystal, is different at different 
faces; it may be said to depend on the direction of the face, i.e. on its Miller 
indices. The form of this dependence is somewhat unusual. Firstly, the 
difference in the values of « for two crystal planes with arbitrarily close direc- 
tions is itself arbitrarily small, i.e. the surface tension can be represented as a 
continuous function of the direction of the face. It can nevertheless be shown 
that this function nowhere has a definite derivative. For example, if we consid- 
er a set of crystal planes intersecting along a common line, and denote by ¢ 
the angle of rotation around this line, which defines the direction of the plane, 
we find that the function « = «(@) has two different derivatives for every 
value of ¢, one for increasing and the other for decreasing values of the argu- 
ment.§ 

Let us suppose that the surface tension is a known function of the direc- 
tion of the faces. The question arises how this function can be used to deter- 
mine the equilibrium form of the crystal. (It must be emphasised that the 
crystal shape observed under ordinary conditions is determined by the condi- 
tions of growth of the crystal and is not the equilibrium shape.) The equilib- 


t The use of the same letter for the surface tension and the critical index « is 
unlikely to cause any misunderstanding. 


t The width of the transition layer is equal to the correlation radius in the van 
der Waals theory also, and therefore the form oc (—1)?~*~” for the surface tension 
(without the use of (149.2)) would be valid in that theory. With « = 0, y= 4, 
the result would be (— 29/2. 

§ This is discussed in more detail by L. D. Landau, Shornik v chest’? 70-letiya 
A, F. loffe, p. 44, Moscow, 1950; translation in Collected Papers of L. D. Landau, 
p. 540, Pergamon, Oxford, 1965. 
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rium form is determined by the condition for the free energy F to be a mini- 
mum (for given T, and volume V of the crystal) or, what is the same thing, 
by the condition for its surface part to be a minimum. The latter is 


i.= po d8, 


the integral being taken over the whole surface of the crystal; for an isotropic 
body « = constant, F, = «8, and the equilibrium form is determined simply 
by the condition for the total area 3 to be a minimum, L.e. it is a sphere. 

Let z = 2(x, y) be the equation of the surface of the crystal, and let p = 
02z/0x, g = 0z/dy denote the derivatives which determine the direction of 
the surface at each point; « can be expressed as a function of these, « = «(p, q). 
The equilibrium form is given by the condition 


fx(p, 9) /( +p? +9") dx dy = minimum (155.1) 
with the added condition of constant volume 
J z.dx dy = constant. (155.2) 
This variational problem leads to the differential equation 
2 ait = Ze 22, (155.3) 
where 
F(a, 9) = ap, 2/1. +P? +9") (155.4) 


and A is a constant. 
Next, we have by definition dz = p dx +gq dy; with the auxiliary function 


€ = px+qy—z, (155.5) 
we find d¢ = x dp+y dg or 
x= 0C/Op, y = 0¢/dg, (155.6) 


¢ being here regarded as a function of p and qg. Writing the derivatives 
with respect to x and y in (155.3) as Jacobians, multiplying both sides by 
O(x, y)/O(p, g) and using (155.6), we obtain the equation 


O(ef/ep, 0¢/0g) | O(0C/Op, Of/2q) _ >A 0(8C/Op, 0¢/0q) 
O(p, 9) (Pp, 9) a(p, 9) 


This equation has an integral f= AC = A(px+qy—z), or 


Boss z(P stage ) (155.7) 
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This is just the equation of the envelope of the family of planes 


px+qy—z = a(p, g)/(. +p? +9")/A, (155.8) 


where p and q act as parameters. 

This result can be expressed in terms of the following geometrical con- 
struction. On each radius vector from the origin we mark off a segment ot 
length proportional to «(p, g), where p and q correspond to the direction of 
that radius vector. A plane is drawn through the end of each segment af 
right angles to the radius vector; then the envelope of these planes gives the 
equilibrium form of the crystal (G. V. Vul’f). 

It can be shown} that the unusual behaviour of the function « mentioned 
at the beginning of this section may have the result that the equilibrium form 
of the crystal determined by this procedure will include a number of plane 
areas corresponding to crystal planes with small values of the Miller indices. 
The size of the plane areas rapidly decreases as the Miller indices increase. 
In practice this means that the equilibrium shape will consist of a small 
number of plane areas which are joined by rounded regions instead of inter- 
secting at sharp angles. 


§ 156. Surface pressure 


The condition that the pressures of two phases in contact are equal has 
been derived (in § 12) from the equality of the forces exerted on the interface 
by the two phases; as elsewhere, surface effects were neglected. It is clear, 
however, that if the interface is not plane a displacement of it will in general 
change its area and therefore the surface energy. In other words, the exist- 
ence of a curved interface between the phases leads to additional forces, 
as a result of which the pressures of the two phases will not be equal. The 
difference between them is called the surface pressure. 

This quantity is determined by the condition of mechanical equilibrium: 
the sum of the forces acting on each phase at the interface should be zero. 
This sum is in turn given by the derivative of a thermodynamic potential with 
respect to the displacement of the interface, the other variables correspond- 
ing to this potential remaining constant; cf. (11.3) and (15.11). In particular, 
in differentiating the potential 2 we must regard P, u and the total volume of 
the system as constants. 

Let us consider two isotropic phases (two liquids, or a liquid and a va- 
pour). We shall assume that one of the phases (phase 1) is a sphere (of ra- 


* The direction cosines of the radius vector are proportonal to p, g, —1. 
t See the paper quoted in the first footnote to this section. 
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dius r) surrounded by the other phase. Then the pressure is constant within 
each phase, and the total thermodynamic potential 2 of the system is 


Q = — PV y—PeVo+08, (156.1) 


the first two terms forming the volume part of the potential; the suffixes 1 
and 2 refer to the two phases. 

The pressures of the two phases satisfy the equations 41(P1, 7) = w2(Pe, T) 
= py, where pw is the common value of the two chemical potentials. Hence, 
for constant » and 7, we must regard P, and P., as constant also, and 
likewise the surface-tension coefficient «. Since V = V1+V¢ is constant, we 
find as the condition of mechanical equilibrium 


(Fr) =— (P,P) Th aS. 


Finally, substituting V; = 4zr3/3, 3 = 4ar?, we obtain the required formula: 
P\—P2 = 2a/r. (156.2) 


For a plane interface (r + oo) the two pressures are equal, as we should 
expect. 

Formula (156.2) determines only the difference between the pressures in 
the two phases. We shall now calculate each of them separately. 

The pressures Pi and Pz satisfy the equation u1(P1, 7) = u2( Pe, T). The 
common pressure Pp in the two phases when the interface is plane is deter- 
mined at the same temperature by the relation u1(Po, 7) = pa(Po, T). Sub- 
traction of these two equations gives 


u(P1, T)— a(Po, T) = (Pa, T)—2( Po, T). (156.3) 


Assuming that the differences 6P; = Pi—Po, 6P2 = P2—Po are relatively 
small and expanding the two sides of equation (156.3) in terms of 6P1 and 
OP2, we find 

V1 OP, = Va 6P2, (156.4) 


where v, and v2 are the molecular volumes (see (24.12)). Combining this with 
formula (156.2) written in the form 6P1—6P2 = 2«/r, we find the required 
OP, and 6P, as 


spp =. Spy = (156.5) 


For a drop of liquid in a vapour, v1 « Ug; regarding the vapour as an 
ideal gas, we have U2 = T/P2 = T/Po, and so 


OP; = 2a/r, bP, = 2vuePo/rT, (156.6) 
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where for clarity the suffixes / and g are used in place of 1 and 2. Thus we see 
that the vapour pressure over the drop is greater than the saturated vapour 
pressure over a plane liquid surface, and increases with decreasing radius of 
the drop. 

When the drop is sufficiently small and 6P,/P) is no longer small, formulae 
(156.6) become invalid, since the large variation of the vapour volume with 
pressure means that the expansion used to derive (156.4) from (156.3) is no 
longer permissible. For a liquid, whose compressibility is small, the effect of 
a change of pressure is slight, and the left-hand side of (156.3) can again be 
replaced by v, 6P,. On the right-hand side we substitute the chemical poten- 
tial of the vapour in the form wu = T log P,+4(T), obtaining 


OP; = P,—Po = (T/v;) log (P,/Po). 


Since in this case 6P, >> OP,, the difference P,—P, can be replaced by P,—P,; 
using formula (156.2) for the surface pressure, we then have finally 


log (P,/Po) = 2av,/rT. (156.7) 


For a bubble of vapour in a liquid we similarly obtain the same formulae 
(156.6), (156.7) but with the opposite signs. 


§ 157. Surface tension of solutions 


Let us now consider an interface between a liquid solution and a gas phase 
(a gas and a solution of it in a liquid, a liquid solution and its vapour, etc.). 

As in § 154, we divide all thermodynamic quantities for the system under 
consideration into volume and surface parts, the manner of division being 
determined by the conditions V = Vi+V2,N = N1+N2 for the volume and 
number of solvent particles. That is, the total volume V of the system is divid- 
ed between the two phases in such a way that, on multiplying V; and V2 by 
the corresponding numbers of solvent particles per unit volume, and adding, 
we obtain just the total number N of solvent particles in the system. Thus by 
definition the surface part N, = 0. 

As well as other quantities, the number of solute particles will also be writ- 
ten as a sum of two parts, n = ny-+tn,. We may say that n, is the quantity of 
solute which would be contained in the volumes V1 and V2 if it were distrib- 
uted in each with a constant concentration equal to the volume concentra- 
tion of the corresponding solution. The number mo thus defined may be either 
greater or less than the actual total number 7 of solute particles. If n, = 
N—No => O, this means that the solute accumulates at a higher concentra- 
tion in the surface layer (called positive adsorption). Ifn, < 0, the concentra- 
tion in the surface layer is less than in the volume (negative adsorption). 
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The surface-tension coefficient of the solution is a function of two inde- 
pendent variables, not one. Since the derivative of the potential 2 with 
respect to the chemical potential is minus the corresponding number of par- 
ticles, n, can be found by differentiating 2, = «8 with respect to the chem- 
ical potential y’ of the solute: 


ns = —0Q,/Ou’ = —38(0a/Oy’)r. (157.1) 


Let us assume that the pressure of the gas phase is so small that its effect 
on the properties of the liquid phase may be neglected. Then the derivative 
of « in formula (157.1), which must be taken along the phase equilibrium 
curve at the temperature concerned, can be replaced by the derivative at 
constant (viz. zero) pressure (and constant 7). Regarding « as a function of 
the temperature and the concentration c of the solution, we can rewrite for- 
mula (157.1) as 

N, = —3(Oau/Oc)zr (Oc/Op’)r, Pp. (157.2) 


According to the thermodynamic inequality (96.7), the derivative (Oy’/Oc)r, p 
is always positive. Hence it follows from (157.2) that n, and (G«/dc); have 
opposite signs. This means that, if the solute raises the surface tension 
(« increases with increasing concentration of the solution), it is negatively 
adsorbed. Substances which lower the surface tension are positively adsorbed. 

If the solution is a weak one, the chemical potential of the solute is of the 
form yp’ = T log c+y(P, T); substituting this in (157.2), we find 


Ns = —3(c/T) (Oa/Oc)r. (157.3) 
A similar formula, 
nN, = —3(P/T) (0«/OP)r, (157.4) 


is obtained for the adsorption of a gas (at pressure P) by a liquid surface. 

If not only the solution but also the adsorption from it is weak, « can be 
expanded in powers of c; we have approximately « = ao +a1c, where ao is 
the surface tension at the interface between two phases of pure solvent. Then 
we have from (157.3) a, = —n,7/8c, and hence 


“a—ao = —n,T/8. (157.5) 


The resemblance between this formula and van’t Hoff’s formula for the os- 
motic pressure should be noticed (the volume being here replaced by the 
surface area). 


t The coefficient « is now a function of two independent variables, e.g. ys’ 
and 7; the derivative 02, /Op’ must be taken at constant 7 and chemical potential 
pu of the solvent. The condition N, = —(02,/O),, 7 = 0 used here implies that 
we formally take (0a /C),, 7 = Oand therefore we can write equation (157.1) (cf. 
the second footnote to § 154). 


526 Surfaces 


§ 158. Surface tension of solutions of strong electrolytes 


The change in surface tension of a liquid when a strong electrolyte is dis- 
solved in it can be calculated in a general form for weak solutions (L. Onsager 
and N. N. T. Samaras, 1934). 

Let w,(x) denote the additional energy of an ion (of the ath kind) because 
of the free surface at a distance x from the ion (w,(x) tending to zero as 
x +oo). The ion concentration near the surface differs from its value c, 
within the solution by a factor e~”/? = 1—w,/T. The contribution of the 
surface to the total number of these ions in the liquid is therefore 


Nas= — Ba | Wa dx, (158.1) 


where v is the molecular volume of the solvent. 
To calculate the surface tension, we begin from the relation 


3 da =—Y tas du’, (158.2) 


where the summation is over all the kinds of ion in the solution. For weak 
solutions (u, = T log c,+y,), 


3de=—-TY, ve deq. (158.3) 


a 


Substitution of (158.1) gives 
fee >) aes [ We dx. (158.4) 


4 


0 


It will be seen from the subsequent results that the main contribution to 
the integral comes from distances x which are large compared with the 
distances between molecules but small compared with the Debye length 1/x. 

The energy w, consists of two parts: 


ee 2 yd 
w= ap S78 + e_(2) (158.5) 
The first term arises from the “image force” on a charge ez, in a medium 
with dielectric constant « at a distance x from its surface. Since x < 1/x, 
the screening effect of the ion cloud round the charge does not alter this 
energy. In the second term, (x) denotes the change (owing to the presence 
of the surface) in the field potential due to all the other ions in the solution. 
This term is unimportant here, however, since it disappears on substituting 
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(158.5)in (158.4) because of the electrical neutrality of the solution (Yc,z,= 0, 
and therefore }'z, dc, = 0). 

Thus, carrying out the integration in (158. yy we find 

(e— le? 

ele +t Iyo 4 ), oe q, 32a): 
The logarithmic divergence of the integral at both limits confirms the state- 
ment made above concerning the range of integration; we have naturally 
taken as the upper limit the screening length 1/x, and as the lower limit a 
quantity a, of the order of atomic dimensions, but different for the different 
kinds of ion. Since x” is proportional to the sum )°z’c,,' we see that the ex- 
pression obtained is a total differential and so can be integrated directly, 
giving 


da = 


_(e—1)e Aza 
_ : ————., 158. 
eee 8e(e+ 1)v Sexi y Coal as Cp 27 ee 


where a» is the surface tension of the pure solvent and the A, are dimensionless 
constants. 

This formula gives the solution of the problem. It should be noticed that 
the dissolution of a strong electrolyte increases the surface tension of the 
liquid. 


§ 159. Adsorption 


Adsorption in the restricted sense includes cases where the solute is con- 
centrated at the surface of a solid or liquid adsorbent,? and hardly any of it 
enters the volume of the adsorbent. The adsorbed film thus formed can be 
described by the surface concentration y, defined as the number of particles 
of the adsorbate (adsorbed substance) per unit surface area. At low pressures 
of the gas from which adsorption occurs, the concentration y must be pro- 
portional to the pressure;'! at high pressures, however, y increases less rapidly 
and tends to a limiting value corresponding to the formation of a mono- 
molecular layer with the adsorbate molecules closely packed together. 

Let y’ be the chemical potential of the adsorbate. By the same method as 
was used in § 96 for ordinary solutions we can derive for adsorption the 
thermodynamic inequality 

(0u'/Oy)r > 0, (159.1) 


t The expression for #? in a solution differs from (78.8) by a factor ¢ in the 
denominator. 

f For definiteness we shall consider adsorption from a gas phase. 

\! This rule is, however, not obeyed in practice for adsorption on a solid surface, 
since such a surface is never sufficiently homogeneous. 


35 


328 Surfaces 
which is entirely analogous to (96.7). From (157.1) we have 


= —(0a/du’)r = —(0a/dy)r(Oy/On)r, (159.2) 
and (159.1) therefore implies that 
(Gx /8y)r < 0, (159.3) 


i.e. surface tension decreases as the surface concentration increases. 
The minimum work which must be done to form the adsorbed film is equal 
to the corresponding change in the thermodynamic potential 2: 


Rmin = 3(%—%0), (159.4) 


where ao is the surface tension on the surface before adsorption. Hence, using 
(91.4), we find the heat of adsorption 


O= -9°( sr ar ad (159.5) 


The adsorbed film may be regarded as a kind of two-dimensional thermo- 
dynamic system, which may be either isotropic or anisotropic, despite the 
isotropy of the two volume phases.’ The question arises of the possible types 
of symmetry of the film. 

At the end of § 137, we have already mentioned that, although the exist- 
ence of a two-dimensional crystal lattice of arbitrarily great extent is impos- 
sible (since it is smoothed out by thermal fluctuations), a film may still show 
solid-crystal properties if it is of relatively small size. But the smoothed-out 
structure is still anisotropic, as noted at the end of § 138. Such an anisotropic 
film can have only an axis of symmetry perpendicular to its plane, and 
planes of symmetry passing through this axis. The only possible types of 
symmetry for such systems are therefore the point groups C, and C,,,. 

As with three-dimensional solids, different phases can exist in two- 
dimensional films. The equilibrium conditions for two phases in a film 
require the equality of their surface tensions as well as their temperatures 
and chemical potentials. The condition on the surface tensions corresponds 
to the condition of equal pressures in three-dimensional phases, and simply 
expresses the necessity for the forces exerted by each phase on the other to 
balance. 


+ Here we are considering adsorption on a liquid surface; adsorption on a solid 
surface is of no interest from this point of view, since, as mentioned above, such 
a surface is almost always inhomogeneous. 

It may be noted that anisotropy of the interface between two isotropic phases 
(liquid and vapour) of the same pure substance is also possible in principle. 
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§ 160. Wetting 


Let us consider adsorption on the surface of a solid from a vapour at a 
pressure close to the saturation value. The equilibrium concentration y is 
determined by the condition that the chemical potential of the adsorbate p’ 
is equal to that of the vapour y,. Various cases can occur according to the 
dependence of y’ on y. 

Let us suppose that the quantity of adsorbate gradually increases and the 
adsorbed layer becomes a liquid film of macroscopic thickness. The surface 
concentration y then becomes a conventionally defined quantity propor- 
tional to the film thickness /: y = ol/m, where m is the mass of a molecule 
and @ the density of the liquid. As the film thickness increases, the chemical 
potential of the substance forming it tends to ,, the chemical potential of the 
liquid in bulk. We shall measure the value of y’ (for given P and T) from this 
limiting value, i.e. write u’ +, in place of yu’; thus, by definition, yu’ -~ 0 as 
y +o, 

The chemical potential of the vapour can be written as u, = w(T)+ 
T log (P/Po), where Po(T) is the saturated vapour pressure; here we have 
used the fact that the saturated vapour is, by definition, in equilibrium with 
the liquid, ic. we must have wy, = pw, when P = P,.' The surface concentra- 
tion is determined by the condition w’ +p, = U,, OF 


u(y) = T log (P/P»). (160.1) 


If this equation is satisfied by several values of y, the one which corre- 
sponds toa stable state is that for which the potential Q,is a minimum. Taking 
the value per unit area of the surface, we obtain a quantity which may be 
called (in the general case of any film thickness) the “effective surface-tension 
coefficient” « at the solid-vapour boundary, and which takes into account 
the presence of the layer between them. Integrating the relation (159.2), we 
can write 


oo 


d - 
a(y) = | YG taro. (160.2) 


v 
The constant is so chosen that as y + o the function «(y) becomes the sum 
of the surface tensions at the “bulk”-phase (solid-liquid and liquid-gas) 
interfaces. 
It may also be recalled that a necessary condition for the thermodynamic 
stability of a state is the inequality (159.1), which is valid for any y. 


t The liquid itself is regarded as incompressible, i.e. we neglect the dependence 
of its chemical potential on the pressure. 
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Let us now consider some typical cases which may occur, depending on 
the nature of the function p(y). In the diagrams given below, the continuous 
curve shows the form of this function in the region of macroscopically thick 
films of liquid, while the broken curve is that for adsorbed films of “molec- 
ular” thickness. It is, of course, not strictly possible to represent the func- 
tion in these two regions in one diagram to the same scale, and to this extent 
the diagrams are a convention. 

In the first case shown (Fig. 76a) the function u’(y) decreases monotoni- 
cally with increasing y (i.e. with increasing film thickness) in the range of 
macroscopic thicknesses. For molecular dimensions the function y’(y) always 
tends to — co as T log y when y + 0, this law corresponding to a “weak 
solution” of the adsorbate on the surface. The equilibrium concentration is 
determined, according to (160.1), by the point of intersection of the curve 
with a horizontal line u’ = constant « 0. In this case, this can occur only at 
molecular concentrations, i.e. ordinary molecular adsorption must occur, 
as discussed in § 159. 
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If u’(y) increases monotonically but is everywhere negative (Fig. 76b), then 
in equilibrium a liquid film of macroscopic thickness is formed on the surface 
of the adsorbent. In particular, when the pressure P = Po (saturated va- 
pour), the film formed must be so thick that the properties of the substance 
in it do not differ from those of the liquid in bulk, i.e. the saturated vapour 
must be in contact with its own liquid phase. In that case we say that the 
liquid completely wets the solid surface in question. 

More complicated cases are also thoretically possible. For example, if the 
function y’(y) passes through zero and has a maximum (Fig. 76c) we have a 
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case of wetting but with formation of a film stable only at thicknesses below a 
certain limit. The film of finite thickness corresponding to the point A is in 
equilibrium with the saturated vapour. This state is separated from the other 
stable state (equilibrium of the solid wall with the bulk liquid) by a metastable 
region AB and a region of complete instability BC. 

The type of curve shown in Fig. 76d corresponds to a film which is unstable 
over a certain range of thickness. The line BF which cuts off equal areas BCD 
and DEF joins points B and F which have equal values of « (and equal y’), 
as is easily seen from (160.2). The branches AB and FG correspond to stable 
films; the range CE is completely unstable, while BC and EF are metastable. 

The two boundaries of the instability range (the points B and F) corre- 
spond in this case to macroscopic film thicknesses. Instability in the range from 
a certain macroscopic thickness to molecular thicknesses would correspond 
to a curve of the type shown in Fig. 76e, but such a curve would more likely 
lead to non-wetting, since the limit of stability would correspond to a point 
on BC where a horizontal line cuts off equal areas below the upper part and 
above the lower part of the curve. But this is usually impossible, since the 
latter area, which is related to the van der Waals forces (see below), is small 
compared with the former, which is related to the considerably greater forces 
at molecular distances. This means that the surface tension everywhere on 
BC is greater than that which would correspond to molecular adsorption on a 
solid surface, and the film will therefore be metastable. 

The chemical potential of the liquid film (measured from y,) represents the 
difference between the energy of the substance in the film and that in the 
bulk liquid. It is therefore clear that y’ is determined by the interaction forces 
between atoms at distances large compared with atomic dimensions and 
~ 1 (van der Waals forces). The potential u’(J) can be calculated in a general 
form, the result being expressed in terms of the permittivities of the solid 
wall and the liquid (see Part 2). 


§ 161. The angle of contact 


Let us consider three bodies in contact, solid, liquid and gas (or one solid 
and two liquid), distinguishing them by suffixes 1, 2, 3 respectively and denot- 
ing the surface-tension coefficients at the interfaces by «12, «13, «23 (Fig. 77). 

Three surface-tension forces act on the line where all three bodies meet, 
each force being in the interface between the corresponding pair of bodies. 
We denote by @ the angle between the surface of the liquid and the plane 
surface of the solid, called the angle of contact. The value of this angle is 
determined by the condition of mechanical equilibrium: the resultant of the 
three surface-tension forces must have no component along the surface of the 
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solid. Thus «13 = «12-+a23 cos 0, whence 
cos 9 = (%13—012)/H23. (161.1) 


If x13 > a19, i.e. the surface tension between the gas and the solid is greater 
than that between the solid and the liquid, than cos 9 > 0 and the angle of 
contact is acute, as in Fig. 77. If «13 < «12, however, the angle of contact is 
obtuse. 

From the expression (161.1) we see that in any actual case of stable contact 
the inequality 


|%1g—1a! << wag (161.2) 


must necessarily hold, since otherwise the condition of equilibrium would 
lead to an imaginary value of the angle 0, which has no meaning. On the 
other hand, if «12, «13, «23 are regarded as the values of the corresponding 
coefficients for each pair of bodies by themselves, without the third one, then 
it may well happen that the condition (161.2) is not satisfied. Actually, how- 
ever, it must be remembered that when three different substances are in 
contact there may in general be an adsorbed film of each substance on the 
interface between the other two, and this lowers the surface tension. The 
resulting coefficients « will certainly satisfy the inequality (161.2), and such 
adsorption will necessarily occur if the inequality would not be satisfied 
without it. 

If the liquid completely wets the solid surface, then a macroscopically thick 
liquid film, not an adsorbed film, is formed on the surface. The gas will 
therefore be everywhere in contact with the same liquid substance, and the 
surface tension between the solid and the gas is not involved at all. The con- 
dition of mechanical equilibrium gives simply cos 6 = 1, i.e. the angle of 
contact is zero. 

Similar arguments are valid for contact between three bodies of which 
none is solid: a liquid drop (3 in Fig. 78) on the surface of another liquid (1) 
adjoining a gas (2). In this case the angles of contact 0; and 02 are determined 
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Fic. 78 


by the vanishing of the resultant of the three surface-tension forces, i.e. of 
their vector sum: 


12+ 13+Ge3 = 0. (161.3) 


Here it is evident that none of the quantities «12, «13, %e3 can be greater than 
the sum or less than the difference of the other two. 


§ 162. Nucleation in phase transitions 


If a substance is in a metastable state, it will sooner or later enter another 
state which is stable. For example, supercooled vapour in time condenses to 
a liquid; a superheated liquid is converted into vapour. This change occurs 
in the following manner. Owing to fluctuations, small quantities of a new 
phase are formed in an originally homogeneous phase; for example, droplets 
of liquid form in a vapour. If the vapour is the stable phase, these droplets are 
always unstable and eventually disappear. If the vapour is supercooled, how- 
ever, then when the droplets formed in it are sufficiently large they are stable 
and in time begin to grow and form a kind of centre of condensation for the 
vapour. The droplets must be sufficiently large in order to compensate the 
unfavourable energy change when a liquid-vapour interface is formed." 
Thus there is a certain minimum or critical size necessary for a nucleus, 
as it is called, of a new phase formed in a metastable phase, in order for it to 
become a centre for formation of the new phase. Since one phase or the 
other is stable for sizes less than and greater than the critical, the “critical 
nucleus” is in unstable equilibrium with the metastable phase. In what 
follows we shall discuss the probability that such nuclei occur.? Because of the 


t It should be borne in mind that this mechanism of formation of a new phase 
can actually occur only in a sufficiently pure substance. In practice, the centres 
of formation of the new phase are usually various kinds of “impurity” (dust 
particles, ions, etc.). 

t The calculation of the probability that nuclei of any size occur is given in 
Problem 2, and illustrates the relationships described. 
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rapid decrease in the probability of fluctuations of increasing size, the begin- 
ning of the phase transition is determined by the probability that nuclei of 
just this minimum necessary size occur. 

Let us consider the formation of nuclei in isotropic phases: the formation 
of liquid droplets in supercooled vapour, or of vapour bubbles in superheat- 
ed liquid. A nucleus may be regarded as spherical, since, owing to its very 
small size, the effect of gravity on its shape is entirely negligible. For a nu- 
cleus in equilibrium with the surrounding medium we have, from (156.2), 
P’—P = 2a/r, and the radius of the nucleus is therefore 


rer = 2a[(P’—P)3 (162.1) 


the primed and unprimed letters everywhere refer to the nucleus and to the 
main (metastable) phase respectively. 

According to the general formula (112.1), the probability w of a fluctua- 
tion producing a nucleus is proportional to exp (— R,;,/7'), where R,,;, is the 
minimum work needed to form the nucleus. Since the temperature and chem- 
ical potential of the nucleus have the same values as in the surrounding 
medium (the main phase), this work is given by the change in the potential 2 
in the process. Before the formation of the nucleus, the volume of the meta- 
stable phase was V +V’ and its potential Q = -P(V+V’); after the forma- 
tion of the nucleus of volume V’, the potential 2 of the whole system is 
—PV —P’V’ +«8. We therefore have 


Rmin = —(P’— PV’ +08. (162.2) 


For a spherical nucleus V’ = 4r3/3 and 8 = 4zr?, and replacing r by (162.1) 
we find 
Rin = 162%08/3(P’—P)*. (162.3) 


As in§ 156, we denote by Po the pressure of both phases (at a given tem- 
perature T) when the interface between them is plane; in other words, Pp is 
the pressure for which the given value of T is the ordinary phase transition 
point, from which the superheating or supercooling is measured. If the 
metastable phase is only slightly superheated or supercooled, the differ- 
ences 6P = P—Po, SP’ = P’—Pp are relatively small and satisfy the equa- 
tion (156.4): 

v 6P’ = v oP, (162.4) 


where v’ and v are the molecular volumes of the nucleus and the metastable 
phase. Replacing P’ —P by 6P’ — 6P in (162.3) and expressing 6P” in terms of 
6P from (162.4), we find for the probability of formation of a nucleus in a 
slightly superheated or supercooled phase 

1670034’? 


wo exp {- 3T(~—vy (OPY (162.5) 


§ 162 Nucleation in Phase Transitions 535 


In the formation of vapour bubbles in a superheated liquid we can neglect 
v in this formula in comparison with v’: 


Lore \ (162.6) 


w oc exp | —araPF 


In the formation of liquid droplets in a supercooled vapour we can neg- 
lect v’ in (162.5) in comparison with v, and substitute v = 7/P = T/Pp. This 
gives 

167003 v’2P2 
wo exp | ~—Srare (162.7) 


The degree of metastability can be defined by the difference 67 = T—T) 
between the temperature T of the metastable phase (with which the nucleus 
is in equilibrium) and the temperature 75 of equilibrium of the two phases 
when the interface is plane, instead of by 6P. According to the Clapeyron— 
Clausius formula, 67 and 6P are related by 


me q 
6P = Too) oT, 


where q is the molecular heat of the transition from the metastable phase to 
the nucleus phase. Substituting 6P in (162.5), we obtain for the probability 
of formation of a nucleus 


HOrF} (162.8) 


we exp | Ser . 


If saturated vapour is in contact with a solid surface (the wall of a vessel) 
which is completely wetted by the liquid, condensation of the vapour will 
occur without nucleation, directly on this surface. The formation of a liquid 
film on the solid surface in this case does not require work to be done to form 
the interface, and so the existence of a metastable phase (supercooling of 
the vapour) is impossible. 

For the same reason, superheating of a solid with a free surface is in gen- 
eral impossible. This is because usually liquids completely wet the surface of 
a solid phase of the same substance; consequently, the formation of a liquid 
layer on the surface of a melting body does not require work to be done to 
form a new surface. 

The formation of nuclei within a crystal on melting can, however, occur if 
the necessary conditions of heating are maintained: the body must be heated 
internally and its surface kept at a temperature below the melting point. The 
probability of formation of nuclei then depends on elastic deformations 
accompanying the creation of liquid droplets within the solid. 
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PROBLEMS 


PROBLEM 1. Determine the probability of formation of a nucleus of a liquid on a 
solid surface for a given (non-zero) value of the angle of contact 0. 


SOLUTION. The nucleus will have the shape of a segment of a sphere with base 
radius r sin 0, r being the radius of the sphere. Its volume is V= 47P(1 —cos 6)? 
(2+-cos 6), and the surface areas of the curved part and the base are respectively 
2z7r°(1—cos 6) and zr? sin? 6. Using the relation (161.1) for the angle of contact, 
we find that the change in 2, on formation of the nucleus is 


oa + 2atr?(1—cos 9)—« cos O-str? sin? 0 = asr*(1—cos 0)? (2+-cos 6), 


where « is the surface-tension coefficient between the liquid and the vapour. This 
change in 2, is the same as would occur in the formation, in the vapour, of a 
spherical nucleus of volume V and surface tension 


—cos 6\ 28 
van = = (2+cos 6)13, 


Accordingly, the required formulae for the formation of nuclei are obtained from 
those derived in the text on replacing « by Gefr. 


PROBLEM 2. Find the probability of formation of a nucleus of arbitrary dimen- 
sions. 


SOLUTION. We regard the metastable phase as an external medium containing the 
nucleus, and calculate the work of formation of the nucleus from formula (20.2): 
Rin = A(E—TyS+PoV) or, since in this case the process occurs at constant tem- 
perature equal to the temperature of the medium, Rin = A(F+PoV). To determine 
this quantity, it is sufficient to consider only the amount of substance which enters 
the other phase (since the state of the remaining substance in the metastable phase 
remains unchanged). Again denoting the quantities pertaining to the substance 
in the original and the new phase by unprimed and primed letters respectively, 
we have 


Ruin = [F'(P)+-PV’ +08]—[F(P)+PV] = ®(P')—®P)—(P’—P)V'+03; (1) 
for a nucleus in unstable equilibrium with the metastable phase we should have 
@’(P’) = ®(P) and thus return to (162.2). 


Assuming the degree of metastability to be small, we have ®’(P’)) = ®’(P)+ 
(P’—P)V’, and (1) thus reduces to 


Renin = nlp/(P)—w (PY) +238, 


where n = V’/v’ is the number of particles in the nucleus. For a spherical nucleus, 


Rin = [WP)— HPN 40. Q) 


In the metastability range, w(P) > p’(P) and hence the first term (the volume term) 
is negative. The expression (2) may be said to describe the potential barrier which 
has to be overcome for the formation of a stable nucleus. It has a maximum at 


r= Pry = 2nv'/[u(P)—p(P)), 
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corresponding to the critical radius of the nucleus. For r < r,, a decrease of r is 
energetically favourable and the nucleus is absorbed; for r > rg, an increase of 
r is favourable and the nucleus grows.t 


§ 163. The impossibility of the existence of phases in one-dimensional systems 


A problem of theoretical interest is that of the possibility of existence of 
more than one phase in one-dimensional (linear) systems, i.e. those in which 
the particles lie along a line. The following argument shows that in fact 
thermodynamic equilibrium between two homogeneous phases in contact 
at a single point and having arbitrarily large extent in length is not possible 
(L. D. Landau, 1950). 

To prove this, let us imagine a linear system consisting of an alternation of 
sections formed by two different phases. Let ®p be the thermodynamic po- 
tential of this system, without allowance for the existence of points of con- 
tact between different phases, i.e. the thermodynamic potential of the total 
amounts of the two phases without regard to their division into sections. 
To take into account the effect of the points of contact, we note that the 
system may be formally regarded as a “solution” of these points in the two 
phases. If the “solution” is weak, the thermodynamic potential ® of the 


system will be 
® = Do+nT log (n/eL) + ny, 


where n is the number of points of contact in a length L. Hence 
0@/0n = T log (n/L) +. 


When the “concentration” n/L is sufficiently small (i.e. for a small number of 
sections of different phases), log(n/L) is large and negative, and therefore 
0®/On < 0. Thus ® decreases with increasing n, and since ® must tend to a 
minimum n will increase (until the derivative 6@/6n becomes positive). That 
is, the two phases will tend to intermingle in shorter and shorter sections, 
and therefore cannot exist as separate phases. 


t The calculation of Rin for r= Por naturally leads to formula (162.5) if we 
note that under these conditions u(P)—p'(P) = (v— +) OP. 
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